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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Recombinant Biocatalysis , Inc. 

(ii) TITLE OF INVENTION: THERMOSTABLE PHOSPHATASES 

(iii) NUMBER OF SEQUENCES: 54 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US97/10784 

(B) FILING DATE: 19-JUN-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/666,857 

(B) FILING DATE: 19-JUN-1996 

(A) APPLICATION NUMBER: 60/033,752 

(B) FILING DATE: 19-JUN-1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haile, Ph.D., Lisa A. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 09010/015WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGGGGCA GGTCCGAAAA GG 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
CCGAGGATCC TCACCGCCCC CTGCGGGTGC G 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGTTGGAT ATACTGCTTG TT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
CCGACGATCC TTATTTTTTA ACCAAATGTT CC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGATGATG GAATTCACTC GC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGGAGGATCC CTACAGTTCT AAAAGTCTTT TA 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGAGAACC CTGACAATAA AC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCGAGGATCC TTACACCCAC AGAACCCTTA C 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAAAGGA AAGTCTCTTG TT 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCGAGGATCC TCAAGCTTCC TGGAGAATCA A 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCCAAGA AATATCGCCG CT 52 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGAGGATCC TTAAGGCTTC TCGAGGTGGG GGTT 34 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid ^ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGTATAAA TGGATTATTG AGGG 54 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCGAGGACTA AACATAGTCT AAGTAATTAG C 31 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAATC CTCCTCACCA AC 52 
(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCGAGGATCC TCACAGGCTC AGAAGCCTTT G 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGAAAAC TTAAAAAAGT ACCT 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCGAGGATCC TCACCGCCCC CTGCGGGTGC G 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...780 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 19: 

ATG AGG GGG AGC GGA GTG CGG ATA CTT CTC ACC AAC GAT GAC GGC ATC 
Met Arg Gly Ser Gly Val Arg lie Leu Leu Thr Asn Asp Asp Gly lie 
15 10 15 

TTT GCC GAG GGT CTG GGG GCT CTG CGC AAG ATG CTG GAG CCC GTG GCT 
Phe Ala Glu Gly Leu Gly Ala Leu Arg Lys Met Leu Glu Pro Val Ala 
20 25 30 

ACC CTT TAC GTG GTG GCT CCG GAC CGA GAG CGT AGC GCG GCC AGC CAT 
Thr Leu Tyr Val Val Ala Pro Asp Arg Glu Arg Ser Ala Ala Ser His 



35 



40 



45 



GCT ATC ACC GTT CAC CGC CCC CTG CGG GTG CGG GAG GCG GGT TTT CGC 192 
Ala He Thr Val His Arg Pro Leu Arg Val Arg Glu Ala Gly Phe Arg 
50 55 60 

AGC CCC AGG CTT AAA GGC TGG GTA GTG GAC GGT ACC CCG GCC GAC TGC 24 0 

Ser Pro Arg Leu Lys Gly Trp Val Val Asp Gly Thr Pro Ala Asp Cys 
65 70 75 80 

GTC AAG CTG GGC CTG GAG GTA CTT TTG CCC GAA CGT CCA GAT TTC CTG 288 
Val Lys Leu Gly Leu Glu Val Leu Leu Pro Glu Arg Pro Asp Phe Leu 
85 90 95 

GTT TCG GGC ATA AAC TAC GGG CCC AAC CTG GGT ACC GAC GTA CTT TAC 33 6 

Val Ser Gly He Asn Tyr Gly Pro Asn Leu Gly Thr Asp Val Leu Tyr 
100 105 110 

TCC GGC ACC GTC TCG GCG GCC ATA GAA GGG GTA ATT AAC GGC ATT CCC 384 
Ser Gly Thr Val Ser Ala Ala He Glu Gly Val He Asn Gly He Pro 
115 120 125 

TCG GTG GCC GTA TCT TTG GCC ACG CGG CGG GAG CCG GAC TAT ACC TGG 432 
Ser Val Ala Val Ser Leu Ala Thr Arg Arg Glu Pro Asp Tyr Thr Trp 
130 135 140 

GCG GCC CGG TTC GTC CTG GTC CTG CTG GAG GAA CTG CGA AAA CAC CAA 4 80 

Ala Ala Arg Phe Val Leu Val Leu Leu Glu Glu Leu Arg Lys His Gin 
145 150 155 160 

CTG CCC CCA GGA ACC CTG CTC AAC GTC AAC GTG CCC GAC GGG GTG CCC 52 8 

Leu Pro Pro Gly Thr Leu Leu Asn Val Asn Val Pro Asp Gly Val Pro 
165 170 175 

CGC GGG GTC AAG GTG ACC AAA CTG GGA AGC GTA CGC TAC GTC AAC GTG 576 
Arg Gly Val Lys Val Thr Lys Leu Gly Ser Val Arg Tyr Val Asn Val 
180 185 190 

GTA GAC TGC CGC ACC GAC CCT CGG GGG AAG GCT TAC TAC TGG ATG GCG 624 
Val Asp Cys Arg Thr Asp Pro Arg Gly Lys Ala Tyr Tyr Trp Met Ala 
195 200 205 

GGA GAA CCA TTG GAG CTG GAC GGC AAC GAC TCC GAA ACC GAC GTC TGG 672 
Gly Glu Pro Leu Glu Leu Asp Gly Asn Asp Ser Glu Thr Asp Val Trp 
210 215 220 

GCG GTG CGA GAA GGC TAT ATT TCC GTA ACA CCG GTC CAG ATC GAC CTT 720 
Ala Val Arg Glu Gly Tyr He Ser Val Thr Pro Val Gin" He Asp Leu 
225 230 235 240 

ACT AAC TAC GGC TTC CTG GAA GAA CTC AAA AAA TGG CGT TTC AAG GAT 768 
Thr Asn Tyr Gly Phe Leu Glu Glu Leu Lys Lys Trp Arg Phe Lys Asp 
245 250 255 

ATC TTT TCT TCT TAA 7 83 

He Phe Ser Ser 
260 

(2) INFORMATION FOR SEQ ID NO: 20; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG TTG GAT ATA CTG CTT GTT AAT GAT GAT GGC ATT TAT TCA AAT GGA 4 8 

Met Leu Asp lie Leu Leu Val Asn Asp Asp Gly lie Tyr Ser Asn Gly 
15 10 15 

TTA ATA GCT TTG AAG GAT GCA TTA TTG GAA AAA TTT AAT GCG AGG ATT 96 
Leu lie Ala Leu Lys Asp Ala Leu Leu Glu Lys Phe Asn Ala Arg He 
20 25 30 



ACT ATT GTA GCC CCA ACA AAT CAG CAG AGT GGT ATT GGT AGG GCA ATA 144 
Thr He Val Ala Pro Thr Asn Gin Gin Ser Gly He Gly Arg Ala He 
35 40 45 

AGT TTA TTC GAG CCG TTA AGG ATA ACT AAA ACC AAA TTA GCA GAT GGT 192 
Ser Leu Phe Glu Pro Leu Arg He Thr Lys Thr Lys Leu Ala Asp Gly 
50 55 60 

TCT TGG GGA TAT GCA GTT TCA GGA ACC CCA ACA GAT TGC GTT ATA TTG 240 
Ser Trp Gly Tyr Ala Val Ser Gly Thr Pro Thr Asp Cys Val He Leu 
65 70 75 80 

GGC ATT TAT GAG ATA TTA AAG AAG GTA CCT GAT GTA GTT ATA TCA GGA 2 88 

Gly He Tyr Glu He Leu Lys Lys Val Pro Asp Val Val He Ser Gly 
85 90 95 

ATA AAC ATT GGA GAA AAC CTT GGG ACT GAA ATA ACA ACT TCT GGA ACG 336 
He Asn He Gly Glu Asn Leu Gly Thr Glu He Thr Thr Ser Gly Thr 
100 105 110 

TTG GGG GCT GCG TTT GAA GGG GCC CAT CAT GGG GCT AAG GCA TTA GCA 3 84 

Leu Gly Ala Ala Phe Glu Gly Ala His His Gly Ala Lys Ala Leu Ala 
115 120 125 

TCA TCA CTC CAA GTT ACC TCT GAC CAT CTA AAG TTT AAA GAG GGG GAG 432 
Ser Ser Leu Gin Val Thr Ser Asp His Leu Lys Phe Lys Glu Gly Glu 
130 135 140 

ACC CCA ATA GAC TTC ACA GTC CCA GCA AGA ATT ACT GCA AAT GTT GTT 480 
Thr Pro He Asp Phe Thr Val Pro Ala Arg He Thr Ala Asn Val Val 
145 150 155 160 

GAG AAG ATG TTG GAT TAT GAT TTC CCA TGT GAT GTC GTC AAC TTA AAC 528 
Glu Lys Met Leu Asp Tyr Asp Phe Pro Cys Asp Val Val Asn Leu Asn 
165 170 175 

ATT CCA GAA GGA GCA ACA GAA AAG ACA CCG ATT GAA ATC ACA AGG TTG 576 
He Pro Glu Gly Ala Thr Glu Lys Thr Pro He Glu He Thr Arg Leu 
180 185 190 

GCA AGG AAA ATG TAT ACA ACA CAC GTT GAG GAA AGA ATA GAT CCA AGA 624 
Ala Arg Lys Met Tyr Thr Thr His Val Glu Glu Arg He Asp Pro Arg 
195 200 205 



GGG AGG AGT TAT TAT TGG ATT GAT GGG TAT CCT ATT TTA GAG GAA GAG 672 
Gly Arg Ser Tyr Tyr Trp He Asp Gly Tyr Pro He Leu Glu Glu Glu 
210 215 220 



GAA GAC ACT GAT GTC TAT GTT GTT AGA AGA AAG GGA CAT ATT TCT CTA 72 0 

Glu Asp Thr Asp Val Tyr Val Val Arg Arg Lys Gly His lie Ser Leu 
225 230 235 240 

ACC CCA TTA ACA TTA GAC ACA ACA ATT AAA AAT TTA GAG GAA TTT AAG 768 
Thr Pro Leu Thr Leu Asp Thr Thr lie Lys Asn Leu Glu Glu Phe Lys 
245 250 255 

AAA AAA TAT GAG AGA ATA TTA AAT GAA TGA 798 
Lys Lys Tyr Glu Arg lie Leu Asn Glu 
260 265 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .762 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG ATG ATG GAA TTC ACT CGC GAG GGA ATA AAA GCT GCT GTA GAG GCA 48 
Met Met Met Glu Phe Thr Arg Glu Gly He Lys Ala Ala Val Glu Ala 
15 10 15 

CTT CAA GGG TTA GGA GAG ATC TAC GTA GTT GCC CCA ATG TTT CAA AGG 96 
Leu Gin Gly Leu Gly Glu He Tyr Val Val Ala Pro Met Phe Gin Arg 
20 25 30 

AGC GCA AGT GGA AGG GCA ATG ACC ATC CAC AGA CCT CTA AGG GCT AAA 144 
Ser Ala Ser Gly Arg Ala Met Thr He His Arg Pro Leu Arg Ala Lys 
35 40 45 

AGA ATA AGT ATG AAC GGT GCA AAA GCA GCC TAT GCT TTG GAT GGA ATG 192 
Arg He Ser Met Asn Gly Ala Lys Ala Ala Tyr Ala Leu Asp Gly Met 
50 55 60 

CCC GTT GAT TGC GTT ATC TTT GCC ATG GCC AGA TTT GGA GAT TTC GAC 240 
Pro Val Asp Cys Val He Phe Ala Met Ala Arg Phe Gly Asp Phe Asp 
65 70 75 80 

CTT GCA ATA AGT GGT GTA AAC TTG GGA GAA AAC ATG AGC ACC GAG ATA 2 88 

Leu Ala He Ser Gly Val Asn Leu Gly Glu Asn Met Ser Thr Glu He 
85 90 95 

ACG GTT TCC GGG ACT GCA AGC GCT GCA ATA GAG GCT GCA ACC CAA GAG 3 36 

Thr Val Ser Gly Thr Ala Ser Ala Ala He Glu Ala Ala Thr Gin Glu 
100 105 110 

ATC CCA AGC ATT CCC ATA AGC CTG GAA GTT AAT AGA GAA AAA CAC AAA 3 84 

He Pro Ser He Pro He Ser Leu Glu Val Asn Arg Glu Lys His Lys 
115 120 125 

TTT GGT GAG GGC GAA GAG ATT GAC TTC TCA GCT GCC AAG TAT TTC CTA 432 
Phe Gly Glu Gly Glu Glu He Asp Phe Ser Ala Ala Lys Tyr Phe Leu 
130 135 140 

AGA AAA ATC GCA ACG GCG GTT TTA AAG AGA GGC CTC CCC AAA GGA GTC 480 



Arg Lys lie Ala Thr Ala Val Leu Lys Arg Gly Leu Pro Lys Gly Val 
145 150 155 160 



GAT ATG CTG AAC GTC AAC GTC COT TAT GAT GCA AAT GAA AGG ACA GAG 
Asp Met Leu Asn Val Asn Val Pro Tyr Asp Ala Asn Glu Arg Thr Glu 
165 170 175 



528 



ATA GOT TTT ACT CGC CTG GCA AGA AGG ATG TAT AGG CCT TCT ATT GAA 
He Ala Phe Thr Arg Leu Ala Arg Arg Met Tyr Arg Pro Ser lie Glu 
180 185 190 



576 



GAG CGC ATA GAC CCA AAG GGG AAT CCC TAC TAC TGG ATA GTT GGA ACT 
Glu Arg He Asp Pro Lys Gly Asn Pro Tyr Tyr Trp He Val Gly Thr 
195 200 205 



624 



CAG TGC CCT AAG GAG GCA TTA GAG CCG GGA ACG GAT ATG TAT GTA GTT 
Gin Cys Pro Lys Glu Ala Leu Glu Pro Gly Thr Asp Met Tyr Val Val 
210 215 220 



672 



AAA GTT GAG AGA AAA GTT AGC GTG ACT CCA ATA AAC ATT GAT ATG ACA 
Lys Val Glu Arg Lys Val Ser Val Thr Pro lie Asn He Asp Met Thr 
225 230 235 240 



720 



GCA AGA GTG AAT TTA GAC GAG ATT AAA AGA CTT TTA GAA CTG 
Ala Arg Val Asn Leu Asp Glu He Lys Arg Leu Leu Glu Leu 
245 250 



762 



TAG 



765 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATG AGA ACC CTG ACA ATA AAC ACT GAC GCG GAG GGG TTC GTT TTG AGG 48 
Met Arg Thr Leu Thr He Asn Thr Asp Ala Glu Gly Phe Val Leu Arg 
15 10 15 

ATT CTC CTG ACG AAC GAC GAT GGA ATC TAC TCC AAC GGA CTG CGC GCC 96 
He Leu Leu Thr Asn Asp Asp Gly He Tyr Ser Asn Gly Leu Arg Ala 
20 25 30 

GCT GTG AAA GCC CTG AGT GAG CTC GGC GAA GTT TAC GTC GTT GCC CCC 144 
Ala Val Lys Ala Leu Ser Glu Leu Gly Glu Val Tyr Val Val Ala Pro 
35 40 45 

CTC TTC CAG AGG AGC GCG AGC GGC AGG GCC ATG ACG CTC CAC AGG CCG 192 
Leu Phe Gin Arg Ser Ala Ser Gly Arg Ala Met Thr Leu His Arg Pro 
50 55 60 

ATA AGG GCC AAG CGC GTT GAC GTT CCC GGC GCA AAG ATA GCC TAC GGA 240 
He Arg Ala Lys Arg Val Asp Val Pro Gly Ala Lys He Ala Tyr Gly 
65 70 75 80 



ATA GAT GGA ACT CCT ACT GAC TGC GTG ATT TTC GCC ATA GCC CGC TTC 
lie Asp Gly Thr Pro Thr Asp Cys Val lie Phe Ala lie Ala Arg Phe 
85 90 95 



288 



GGG AGC TTT GGT TTA GCC GTG AGC GGG ATT AAC CTC GGC GAG AAC CTG 336 
Gly Ser Phe Gly Leu Ala Val Ser Gly lie Asn Leu Gly Glu Asn Leu 
100 105 110 

AGC ACC GAG ATA ACA GTC TCA GGG ACG GCC TCC GCT GCC ATA GAG GCC 3 84 

Ser Thr Glu lie Thr Val Ser Gly Thr Ala Ser Ala Ala lie Glu Ala 
115 120 125 

TCA ACT CAT GGA ATT CCG AGC ATA GCG ATT AGC CTT GAG GTG GAG TGG 432 
Ser Thr His Gly lie Pro Ser He Ala lie Ser Leu Glu Val Glu Trp 
130 135 140 

AAG AAG ACC CTC GGC GAG GGT GAG GGG GTT GAC TTC TCG GTC TCG ACT 480 
Lys Lys Thr Leu Gly Glu Gly Glu Gly Val Asp Phe Ser Val Ser Thr 
145 150 155 160 

CAC TTC CTC AAG AGA ATC GCG GGA GCC CTC TTG GAG AGA GGT CTT CCT 528 
His Phe Leu Lys Arg He Ala Gly Ala Leu Leu Glu Arg Gly Leu Pro 
165 170 175 

GAG GGC GTT GAC ATG CTC AAC GTC AAC GTT CCG AGC GAC GCG ACG GAG 576 
Glu Gly Val Asp Met Leu Asn Val Asn Val Pro Ser Asp Ala Thr Glu 
180 185 190 

GAA ACG GAG ATA GCA ATC ACC CGC TTA GCC CGG AAG CGC TAC TCC CCA 624 
Glu Thr Glu He Ala He Thr Arg Leu Ala Arg Lys Arg Tyr Ser Pro 
195 200 205 

ACG GTC GAG GAG AGG ATT GAC CCC AAG GGC AAC CCC TAC TAC TGG ATT 672 
Thr Val Glu Glu Arg He Asp Pro Lys Gly Asn Pro Tyr Tyr Trp He 
210 215 220 

GTC GGC AAA CTT GTC CAA GAC TTC GAG CCA GGG ACA GAT GCC TAC GCC 720 
Val Gly Lys Leu Val Gin Asp Phe Glu Pro Gly Thr Asp Ala Tyr Ala 
225 230 235 240 

CTG AAG GTC GAG AGG AAG GTC AGC GTC ACG CCG ATA AAC ATA GAT ATG 768 
Leu Lys Val Glu Arg Lys Val Ser Val Thr Pro He Asn He Asp Met 
245 250 255 

ACT GCG AGG GTG GAC TTT GAG GAG CTT GTA AGG GTT CTG TGG GTG 813 
Thr Ala Arg Val Asp Phe Glu Glu Leu Val Arg Val Leu Trp Val 
260 265 270 

TAA 816 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1491 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



ATG AAA GGA AAG TCT CTT GTT AGC GGT CTG TTG TTG GGT CTT TTA ATT 
Met Lys Gly Lys Ser Leu Val Ser Gly Leu Leu Leu Gly Leu Leu lie 
15 10 15 

TTG AGC CTG ATT TCA TTC CAG CCA AGC TTT GCA TAC TCC CCA CAC GGC 
Leu Ser Leu He Ser Phe Gin Pro Ser Phe Ala Tyr Ser Pro His Gly 
20 25 30 

GGT GTC AAA AAC ATC ATA ATC CTG GTT GGA GAC GGC ATG GGT CTT GGG 
Gly Val Lys Asn He He He Leu Val Gly Asp Gly Met Gly Leu Gly 
35 40 45 

CAT GTA GAA ATT ACA AAG CTC GTT TAT GGA CAC TTA AAC ATG GAA AAC 
His Val Glu He Thr Lys Leu Val Tyr Gly His Leu Asn Met Glu Asn 
50 55 60 

TTT CCA GTT ACT GGA TTT GAG CTT ACT GAT TCC CTA AGT GGT GAA GTT 
Phe Pro Val Thr Gly Phe Glu Leu Thr Asp Ser Leu Ser Gly Glu Val 
65 70 75 80 

ACA GAT TCT GCT GCG GCA GGA ACT GCA ATA TCC ACT GGA GCT AAA ACG 
Thr Asp Ser Ala Ala Ala Gly Thr Ala He Ser Thr Gly Ala Lys Thr 
85 90 95 

TAT AAT GGT ATG ATT TCA GTA ACC AAC ATA ACC GGA AAG ATA GTT AAC 
Tyr Asn Gly Met He Ser Val Thr Asn He Thr Gly Lys He Val Asn 
100 105 110 

TTA ACA ACC CTA CTT GAA GTG GCT CAA GAG CTT GGG AAG TCA ACA GGG 
Leu Thr Thr Leu Leu Glu Val Ala Gin Glu Leu Gly Lys Ser Thr Gly 
115 120 125 

CTG GTC ACC ACA ACA AGG ATT ACC CAT GCA ACT CCA GCA GTT TTT GCG 
Leu Val Thr Thr Thr Arg He Thr His Ala Thr Pro Ala Val Phe Ala 
130 135 140 

TCC CAT GTC CCA GAT AGG GAT ATG GAG GGG GAG ATA CCC AAG CAA CTC 
Ser His Val Pro Asp Arg Asp Met Glu Gly Glu He Pro Lys Gin Leu 
145 150 155 160 

ATA ATG CAC AAA GTT AAC GTC TTG TTG GGT GGT GGA AGG GAG AAA TTC 
He Met His Lys Val Asn Val Leu Leu Gly Gly Gly Arg Glu Lys Phe 
165 170 175 

GAT GAG AAA AAT TTG GAG CTG GCC AAA AAG CAG GGA TAC AAA GTA GTT 
Asp Glu Lys Asn Leu Glu Leu Ala Lys Lys Gin Gly Tyr Lys Val Val 
180 185 190 

TTC ACG AAG GAA GAG CTT GAA AAA GTT GAA GGA GAT TAT GTC CTA GGA 
Phe Thr Lys Glu Glu Leu Glu Lys Val Glu Gly Asp Tyr Val Leu Gly 
195 200 205 

CTC TTT GCA GAA AGT CAC ATC CCT TAC GTA TTG GAT AGA AAA CCC GAT 
Leu Phe Ala Glu Ser His He Pro Tyr Val Leu Asp Arg Lys Pro Asp 
210 215 220 

GAT GTT GGA CTT TTA GAA ATG GCC AAA AAG GCA ATT TCA ATA CTC GAG 
Asp Val Gly Leu Leu Glu Met Ala Lys Lys Ala He Ser He Leu Glu 
225 230 235 240 

AAG AAC CCG AGC GGA TTC TTT CTC ATG GTT GAG GGC GGA AGG ATT GAC 
Lys Asn Pro Ser Gly Phe Phe Leu Met Val Glu Gly Gly Arg He Asp 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



245 



250 



255 



CAT GCA GCC CAT GGA AAC GAT GTC GCA TCG GTT GTT GCA GAA ACT AAG 816 
His Ala Ala His Gly Asn Asp Val Ala Ser Val Val Ala Glu Thr Lys 
260 265 270 

GAG TTT GAC GAT GTT GTC AGA TAC GTG CTG GAA TAT CCG AAG AAG AGG 864 
Glu Phe Asp Asp Val Val Arg Tyr Val Leu Glu Tyr Pro Lys Lys Arg 
275 280 285 

GGA GAT ACC TTG GTA ATA GTG CTT GCC GAT CAC GAA ACT GGA GGT CTT 912 
Gly Asp Thr Leu Val lie Val Leu Ala Asp His Glu Thr Gly Gly Leu 
290 295 300 

GCA ATA GGT CTA ACG TAT GGA AAT GCA ATC GAT GAA GAT GCC ATA AGA 960 
Ala lie Gly Leu Thr Tyr Gly Asn Ala lie Asp Glu Asp Ala lie Arg 
305 310 315 320 

AAA ATA AAA GCA AGC ACG TTG AGG ATG CCC AAA GAG GTT AAG GCA GGG 1008 
Lys lie Lys Ala Ser Thr Leu Arg Met Pro Lys Glu Val Lys Ala Gly 
325 330 335 

AGT AGT GTA AAA GAG TCC TCA AAG GTA TGC CGG ATT TGT CCC AAC AGA 1056 
Ser Ser Val Lys Glu Ser Ser Lys Val Cys Arg lie Cys Pro Asn Arg 
340 345 350 

GGA AGA AGT CAG TAT ATT GAG AAT GCG CTG CAC TCG ACA AAC AAG TAT 1104 
Gly Arg Ser Gin Tyr lie Glu Asn Ala Leu His Ser Thr Asn Lys Tyr 
355 360 365 

GCC CTC TCA AAT GCA GTA GCC GAT GTT ATA AAC AGG CGT ATT GGT GTT 1152 
Ala Leu Ser Asn Ala Val Ala Asp Val lie Asn Arg Arg He Gly Val 
370 375 380 

GGA TTC ACC TCC TAT GAG CAT ACA GGA GTT CCA GTT CCG CTC TTA GCT 1200 
Gly Phe Thr Ser Tyr Glu His Thr Gly Val Pro Val Pro Leu Leu Ala 
385 390 395 400 

TAC GGT CCC GGG GCA GAG AAC TTC AGA GGT TTC TTA CAC CAT GTG GAT 1248 
Tyr Gly Pro Gly Ala Glu Asn Phe Arg Gly Phe Leu His His Val Asp 
405 410 415 

ACA GCA AGA TTA GTT GCA AAG TTA ATG CTC TTT GGA AGG AGG AAT ATT 1296 
Thr Ala Arg Leu Val Ala Lys Leu Met Leu Phe Gly Arg Arg Asn He 
420 425 430 

CCA GTT ACC ATT TCA AGC GTG AGC AGT GTT AAG GGA GAC ATA ACC GGT 1344 
Pro Val Thr He Ser Ser Val Ser Ser Val Lys Gly Asp He Thr Gly 
435 440 445 

GAT TAC AGG GTT GAT GAG AAG GAT GCC TAC GTT ACG CTC ATG ATG TTT 13 92 
Asp Tyr Arg Val Asp Glu Lys Asp Ala Tyr Val Thr Leu Met Met Phe 
450 455 460 

CTC GGA GAA AAA GTG GAT AAT GAA ATT GAA AAG AGA GTC GAT ATA GAC 144 0 
Leu Gly Glu Lys Val Asp Asn Glu He Glu Lys Arg Val Asp He Asp 
465 470 475 480 

AAC AAC GGC ATG GTT GAC TTA AAT GAC GTC ATG TTG ATT CTC CAG GAA 14 88 
Asn Asn Gly Met Val Asp Leu Asn Asp Val Met Leu He Leu Gin Glu 
485 490 495 

GCT TGA ^4 5^ 

Ala 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1752 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATG CCA AGA AAT ATC GCC GCT GTA TGC GCC CTG GCC GCT TTG TTA GGG 48 
Met Pro Arg Asn He Ala Ala Val Cys Ala Leu Ala Ala Leu Leu Gly 
15 10 15 

TCG GCC TGG GCG GCC AAA GTT GCC GTC TAC CCC TAC GAC GGA GCC GCT 96 
Ser Ala Trp Ala Ala Lys Val Ala Val Tyr Pro Tyr Asp Gly Ala Ala 
20 25 30 

TTG CTG GCG GGG CAG CGC TTC GAT TTG CGC ATA GAA GCC TCC GAG CTG 144 
Leu Leu Ala Gly Gin Arg Phe Asp Leu Arg He Glu Ala Ser Glu Leu 
35 40 45 

AAA GGC AAT TTA AAG GCT TAC CGC ATC ACC CTG GAC GGC CAG CCT CTG 192 
Lys Gly Asn Leu Lys Ala Tyr Arg He Thr Leu Asp Gly Gin Pro Leu 
50 55 60 

GCG GGC CTC GAG CAA ACC GCG CAG GGG GCC GGG CAG GCC GAG TGG ACC 240 
Ala Gly Leu Glu Gin Thr Ala Gin Gly Ala Gly Gin Ala Glu Trp Thr 
65 70 75 80 

CTG CGC GGT GCC TTC CTG CGC CCT GGA AGC CAC ACC CTC GAG GTC AGC 288 
Leu Arg Gly Ala Phe Leu Arg Pro Gly Ser His Thr Leu Glu Val Ser 
85 90 95 

CTC ACC GAC GAC GCT GGG GAG AGC AGG AAG AGC GTA CGT TGG GAG GCT 336 
Leu Thr Asp Asp Ala Gly Glu Ser Arg Lys Ser Val Arg Trp Glu Ala 
100 105 110 

CGG CAG AAC CTT CGC TTG CCC CGA GCG GCC AAG AAT GTG ATT CTC TTC 384 
Arg Gin Asn Leu Arg Leu Pro Arg Ala Ala Lys Asn Val He Leu Phe 
115 120 125 

ATT GGC GAC GGG ATG GGC TGG AAC ACC CTC AAC GCC GCC CGC ATC ATC 432 
He Gly Asp Gly Met Gly Trp Asn Thr Leu Asn Ala Ala Arg He He 
130 135 140 

GCC AAA GGC TTT AAC CCC GAA AAC GGT ATG CCC AAC GGA AAC CTC GAG 480 
Ala Lys Gly Phe Asn Pro Glu Asn Gly Met Pro Asn Gly Asn Leu Glu 
145 150 155 160 

ATC GAG AGT GGT TAC GGT GGG ATG GCT ACC GTC ACT ACC GGC AGC TTT 528 
He Glu Ser Gly Tyr Gly Gly Met Ala Thr Val Thr Thr Gly Ser Phe 
165 170 175 

GAT AGC TTC ATC GCC GAC TCA GCT AAC TCG GCT TCT TCC ATC ATG ACC 576 
Asp Ser Phe He Ala Asp Ser Ala Asn Ser Ala Ser Ser He Met Thr 
180 185 190 



GGG CAG AAG 
Gly Gin Lys 
195 



GTG CAG 
Val Gin 



GTG 
Val 



AAT GCC 
Asn Ala 
200 



CTC AAC GTT 
Leu Asn Val 



TAG CCA TCA AAC CTC 
Tyr Pro Ser Asn Leu 
205 



624 



AAA GAT ACC 
Lys Asp Thr 
210 



CTG GCC 
Leu Ala 



TAC 
Tyr 



CCC CGG 
Pro Arg 
215 



ATC GAA ACC 
He Glu Thr 



CTA GCG GAG ATG CTC 
Leu Ala Glu Met Leu 
220 



672 



AAG CGG GTA 
Lys Arg Val 
225 



CGC GGG 
Arg Gly 



GCC 
Ala 
230 



AGC ATT 
Ser He 



GGG GTA GTG 
Gly Val Val 
235 



ACC ACC ACC TTC 
Thr Thr Thr Phe 



GGC 
Gly 
240 



720 



ACC GAC GCT 
Thr Asp Ala 



ACC CCG 
Thr Pro 
245 



GCT 
Ala 



TCA CTC 
Ser Leu 



AAC GCC CAT 
Asn Ala His 
250 



ACC CGC 
Thr Arg 



CGC CGC 
Arg Arg 
255 



GGT 
Gly 



768 



GAT TAC CAG 
Asp Tyr Gin 



GCT ATC 
Ala He 
260 



GCC 
Ala 



GAC ATG 
Asp Met 



TAC TTT GGT 
Tyr Phe Gly 
265 



AGA GGC 
Arg Gly 



GGG TTC 
Gly Phe 
270 



GGT 
Gly 



816 



GTT 
Val 



CCC TTG 
Pro Leu 
275 



GAT GTG 
Asp Val 



ATG 
Met 



CTC TTC 
Leu Phe 
280 



GGT GGT TCA 
Gly Gly Ser 



CGC GAC 
Arg Asp 
285 



TTC ATC 
Phe He 



CCC 
Pro 



864 



CAG 
Gin 



AGC ACC 
Ser Thr 
290 



CCT GGC 
Pro Gly 



TCG 
Ser 



CGG CGC 
Arg Arg 
295 



AAG GAT AGC 
Lys Asp Ser 



ACG GAC 
Thr Asp 
300 



TGG ATT 
Trp He 



GCC 
Ala 



912 



GAA 
Glu 
305 



TCC CAG 
Ser Gin 



AAG CTG 
Lys Leu 



GGC 
Gly 
310 



TAC ACC 
Tyr Thr 



TTT GTC AGC 
Phe Val Ser 
315 



ACC CGC 
Thr Arg 



AGC GAG 
Ser Glu 



CTG 
Leu 
320 



960 



CTG 
Leu 



GCG GCC 
Ala Ala 



AAA CCC 
Lys Pro 
325 



ACC 
Thr 



GAT AAG 
Asp Lys 



CTG TTT GGG 
Leu Phe Gly 
330 



CTG TTC AAC ATT GAC 
Leu Phe Asn He Asp 
335 



1008 



AAC TTC CCC 
Asn Phe Pro 



AGC TAC 
Ser Tyr 
340 



CTA 
Leu 



GAC CGC 
Asp Arg 



GCA GTG TGG 
Ala Val Trp 
345 



AAG CGG CCC GAG ATG 
Lys Arg Pro Glu Met 
350 



1056 



CTG GGA AGC 
Leu Gly Ser 
355 



TTT ACC 
Phe Thr 



GAT 
Asp 



ATG CCC 
Met Pro 
360 



TAC CTC TGG 
Tyr Leu Trp 



GAG ATG ACC CAG AAA 
Glu Met Thr Gin Lys 
365 



1104 



GCC GTG GAG 
Ala Val Glu 
370 



GCT CTC 
Ala Leu 



TCC 
Ser 



AGA AAC 
Arg Asn 
375 



GAC AAA GGC 
Asp Lys Gly 



TTT TTC TTG ATG GTT 
Phe Phe Leu Met Val 
380 



1152 



GAG GGG GGA 
Glu Gly Gly 
385 



ATG GTG 
Met Val 



GAT 
Asp 
390 



AAG TAC 
Lys Tyr 



GAG CAC CCC 
Glu His Pro 
395 



TTG GAC TGG CCC 
Leu Asp Trp Pro 



CGC 
Arg 
400 



1200 



GCA CTT TGG 
Ala Leu Trp 



GAT GTA 
Asp Val 
405 



CTC 
Leu 



GAG CTG 
Glu Leu 



GAC CGC GCG 
Asp Arg Ala 
410 



GTG GCT 
Val Ala 



TGG GCC 
Trp Ala 
415 



AAG 
Lys 



1248 



GGC TAT GCG 
Gly Tyr Ala 



GCC TCC 
Ala Ser 
420 



CAC 
His 



CCC GAT 
Pro Asp 



ACC CTG GTG 
Thr Leu Val 
425 



ATT GTC 
He Val 



ACC GCC 
Thr Ala 
430 



GAC 
Asp 



1296 



CAC GCT CAC 
His Ala His 
435 



TCG ATC 
Ser He 



TCG 
Ser 



GTG TTT 
Val Phe 
440 



GGC GGT TAC 
Gly Gly Tyr 



GAC TAC 
Asp Tyr 
445 



TCC AAG 
Ser Lys 



CAG 
Gin 



1344 



GGC CGG GAG GGG GTG GGG GTT TAT GAG GCC GCC AAG TTC CCC ACC TAC 



1392 



Gly Arg Glu Gly Val Gly Val Tyr Glu Ala Ala Lys Phe Pro Thr Tyr 
450 455 460 



GGC GAC AAA 
Gly Asp Lys 
465 



CGG GGC CGC 
Arg Gly Arg 



GGT TAG 
Gly Tyr 



TAG CGG CAA 
Tyr Arg Gin 
530 

CCC ATG CCG 
Pro Met Pro 
545 

CTC ATC GAC 
Leu lie Asp 



ATA TTA GAG 
lie Leu Glu 



AAA GAC GCC 
Lys Asp Ala 
470 



GAG 
Glu 
500 



CTC 
Leu 



CTG TTT GCC 
Leu Phe Ala 
550 



AAC GGC TTT 
Asn Gly Phe 



TTT GGC GTG 
Phe Gly Val 



GAG GGT AAG 
Glu Gly Lys 



GAA CAA CCG 
Glu Gin Pro 
40 



CCC TTG CCC 
Pro Leu Pro 
475 



CTT AAT GAG 
Leu Asn Glu 



GAC ACC ACT 
Asp Thr Thr 



AAA TAT ATC 
Lys Tyr lie 
45 



CGG 1440 

Arg 

480 

1488 
1536 
1584 
1632 
1680 
1728 
1755 



48 
96 

GAG 144 
Glu 

192 



TTC AAC CCC CAC CTC GAG AAG CCT TAA 
Phe Asn Pro His Leu Glu Lys Pro 
580 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



ATG TAT AAA TGG ATT ATT 
Met Tyr Lys Trp He He 
1 5 

AGC CTA GGT GAA CTA GCC GAT CTC AAA AGA CTT TTC GAC GCC ATT ATT 
Ser Leu Gly Glu Leu Ala Asp Leu Lys Arg Leu Phe Asp Ala He He 
20 25 30 



GTT CTT ACA ATG CCG CAT 
Val Leu Thr Met Pro His 
35 



AGC CAT GGA TTC CAA GTC CTC CAT GTC CCC ACG CTC GAC 
Ser His Gly Phe Gin Val Leu His Val Pro Thr Leu Asp 



GGA ATC GCG GTA GGC TTC GGG GCC ACG CCG GAT TAC TGT GAA ACC TAG 

Gly He Ala Val Gly Phe Gly Ala Thr Pro Asp Tyr Cys Glu Thr Tyr 
485 490 495 

GTC TAC AAA GAC CCC ACC ATC TCC GAC GGC AAA GGT 

Val Tyr Lys Asp Pro Thr He Ser Asp Gly Lys Gly 

505 510 

GTG GCC AAC CCT GAG GTC TGC AAG GAG CCG. GGC CTT CCA ACG 

Val Ala Asn Pro Glu Val Cys Lys Glu Pro Gly Leu Pro Thr 
515 520 525 



CCA GTA GAT AGC GCC CAG GGC GTG CAC ACG GCT GAT 
Pro Val Asp Ser Ala Gin Gly Val His Thr Ala Asp 
535 540 



GGG TCT CAG TTC TTC AAT GGC 
Gly Ser Gin Phe Phe Asn Gly 
555 560 

CAG ACC GAG ATC TTC TTC CGC ATG GCC CAG GCC CTA GGG 
Gin Thr Glu He Phe Phe Arg Met Ala Gin Ala Leu Gly 
565 570 575 



NO:25 : 



A 



ce 



• ID NO: 25: 

CTT GCC CAA GCA CCT TTT CCA 

Leu Ala Gin Ala Pro Phe Pro 

10 15 



50 55 60 

TTT CAT CCT TTA GAA CTC TTC GAC CTT TTG AAA ACA AGC ATA TTC ATT 24 0 

Phe His Pro Leu Glu Leu Phe Asp Leu Leu Lys Thr Ser lie Phe lie 
65 70 75 80 

GAT GAA AAC CTG GAG AGA TCC CAC AGA GTG CTT GTC CAC TGC ATG GGA 2 88 

Asp Glu Asn Leu Glu Arg Ser His Arg Val Leu Val His Cys Met Gly 
85 90 95 

GGC ATA GGC CGG AGC GGG CTT GTA ACT GCT GCG TAG TTA ATA TTC AAA 336 
Gly He Gly Arg Ser Gly Leu Val Thr Ala Ala Tyr Leu He Phe Lys 
100 105 110 

GGT TAT GAT ATT TAC GAC GCG GTA AAG CAT GTG AGA ACG GTA GTG CCT 384 
Gly Tyr Asp lie Tyr Asp Ala Val Lys His Val Arg Thr Val Val Pro 
115 120 125 

GGT GCT ATT GAA AAC AGA GGG CAA GCG TTA ATG CTT GAG AAC TAC TAT 432 
Gly Ala He Glu Asn Arg Gly Gin Ala Leu Met Leu Glu Asn Tyr Tyr 
130 135 140 

ACC CTG GTC AAA AGT TTC AAC AGA GAG TTG CTG AGA GAC TAC GGG AAG 480 
Thr Leu Val Lys Ser Phe Asn Arg Glu Leu Leu Arg Asp Tyr Gly Lys 
145 150 155 160 

AAA ATT TTC ACG CTC GGT GAC CCG AAG GCG GTT CTC CAC GCT TCT AAG 528 
Lys lie Phe Thr Leu Gly Asp Pro Lys Ala Val Leu His Ala Ser Lys 
165 170 175 

ACG ACT CAG TTC ACG ATT GAA CTC TTA AGC AAC TTA CAC GTC AAC GAG 576 
Thr Thr Gin Phe Thr He Glu Leu Leu Ser Asn Leu His Val Asn Glu 
180 185 190 

GCG TTT TCA ATC AGT GCG ATG GCT CAA TCA CTG CTC CAC TTT CAC GAC 624 
Ala Phe Ser He Ser Ala Met Ala Gin Ser Leu Leu His Phe His Asp 
195 200 205 

GTA AAA GTC CGC TCT AAA CTG AAA GAA GTA TTC GAA AAC ATG GAA TTC 672 
Val Lys Val Arg Ser Lys Leu Lys Glu Val Phe Glu Asn Met Glu Phe 
210 215 220 

TCA TCC GCC TCA GAG GAG GTT CTG TCA TTT ATT CAC CTA CTC GAT TTC 720 
Ser Ser Ala Ser Glu Glu Val Leu Ser Phe He His Leu Leu Asp Phe 
225 230 235 240 

TAT CAG GAT GGC AGG GTT GTT TTA ACC ATT TAC GAT TAT CTC CCC GAT 768 
Tyr Gin Asp Gly Arg Val Val Leu Thr He Tyr Asp Tyr Leu Pro Asp 
245 250 255 

AGG GTG GAT TTG ATT TTA TTG TGT AAG TGG GGT TGT GAT AAA ATA GTT 816 
Arg Val Asp Leu He Leu Leu Cys Lys Trp Gly Cys Asp Lys He Val 
260 265 270 

GAA GTC TCG TCT TCA GCG AAG AAA ACC GTT GAG AAG CTT GTA GGA AGA 864 
Glu Val Ser Ser Ser Ala Lys Lys Thr Val Glu Lys Leu Val Gly Arg 
275 280 285 

AAG GTT TCC CTA TCC TGG GCT AAT TAC TTA GAC TAT GTT TAG 906 
Lys Val Ser Leu Ser Trp Ala Asn Tyr Leu Asp Tyr Val 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .771 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATG AGA ATC CTC CTC ACC AAC GAC GAC GGC ATC TAT TCC AAC GGT CTG 4 8 

Met Arg lie Leu Leu Thr Asn Asp Asp Gly lie Tyr Ser Asn Gly Leu 
15 10 15 

CGC GCG GCG GTG AAG GGC CTG AGC GAG CTC GGC GAG GTC TAC GTC GTC 96 
Arg Ala Ala Val Lys Gly Leu Ser Glu Leu Gly Glu Val Tyr Val Val 
20 25 30 

GCC CCG CTC TTC CAG AGG AGC GCG AGC GGT CGG GCG ATG ACC CTA CAC 144 
Ala Pro Leu Phe Gin Arg Ser Ala Ser Gly Arg Ala Met Thr Leu His 
35 40 45 

AGG CCG ATA AGG GCA AAG AGG GTT GAC GTT CCC GGC GCG AAG ATA GCG 192 
Arg Pro lie Arg Ala Lys Arg Val Asp Val Pro Gly Ala Lys lie Ala 
50 55 60 

TAT GGC ATA GAC GGA ACG CCG ACC GAC TGC GTG ATT TTT GCC ATC GCC 240 
Tyr Gly lie Asp Gly Thr Pro Thr Asp Cys Val lie Phe Ala lie Ala 
65 70 75 80 

CGC TTC GGC GAC TTT GAT CTG GCG GTC AGC GGG ATA AAC CTA GGC GAG 288 
Arg Phe Gly Asp Phe Asp Leu Ala Val Ser Gly lie Asn Leu Gly Glu 
85 90 95 

AAC CTG AGC ACG GAG ATA ACC GTC TCC GGA ACG GCC TCG GCG GCG ATA 336 
Asn Leu Ser Thr Glu He Thr Val Ser Gly Thr Ala Ser Ala Ala He 
100 105 110 

GAG GCT TCC ACC CAC GGG ATT CCA AGT GTA GCT ATA AGC CTC GAG GTC 384 
Glu Ala Ser Thr His Gly He Pro Ser Val Ala He Ser Leu Glu Val 
115 120 125 

GAG TGG AAG AAG ACC CTC GGC GAG GGG GAG GGT ATT GAC TTC TCG GTT 432 
Glu Trp Lys Lys Thr Leu Gly Glu Gly Glu Gly He Asp Phe Ser Val 
130 135 140 

TCA GCA CAC TTC CTG AGA AGG ATA GCG ACG GCT GTC CTT AAG AAG GGC 480 
Ser Ala His Phe Leu Arg Arg He Ala Thr Ala Val Leu Lys Lys Gly 
145 150 155 160 

CTG CCT GAA GGG GTG GAC ATG CTC AAC GTG AAC GTC CCT AGC GAC GCC 52 8 

Leu Pro Glu Gly Val Asp Met Leu Asn Val Asn Val Pro Ser Asp Ala 
165 170 175 

AGC GAG GGG ACT GAG ATC GCC ATA ACG CGC CTC GCG AGG AAG CGC TAT 576 
Ser Glu Gly Thr Glu He Ala He Thr Arg Leu Ala Arg Lys Arg Tyr 
180 185 190 



TCT CCG ACG ATA GAG GAG AGG ATA GAC CCC AAG GGC AAC CCC TAC TAC 
Ser Pro Thr He Glu Glu Arg He Asp Pro Lys Gly Asn Pro Tyr Tyr 
195 200 205 



624 



TGG ATC GTT GGC AGO CTC GTC CAG GAG TTC GAG CCG GGC ACG GAG GCC 
Trp lie Val Gly Arg Leu Val Gin Glu Phe Glu Pro Gly Thr Asp Ala 
210 215 220 



672 



TAG GOT CTG AAA GTC GAG AGA AAG GTC AGC GTC ACG CCC ATA AAC ATC 
Tyr Ala Leu Lys Val Glu Arg Lys Val Ser Val Thr Pro He Asn He 
225 230 235 240 



720 



GAC ATG ACT GCG AGG GTT GAC TTT GAG AAC CTT CAA AGG CTT CTG AGC 
Asp Met Thr Ala Arg Val Asp Phe Glu Asn Leu Gin Arg Leu Leu Ser 
245 250 255 



768 



CTG TGA 
Leu 



774 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...7 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

ATG GAA AAC TTA AAA AAG TAG CTA GAA GTT GCA AAA ATA GCC GCG CTC 48 
Met Glu Asn Leu Lys Lys Tyr Leu Glu Val Ala Lys He Ala Ala Leu 
15 10 15 

GCG GGT GGG CAG GTT CTG AAA GAA AAC TTC GGA AAG GTA AAA AAG GAA 96 
Ala Gly Gly Gin Val Leu Lys Glu Asn Phe Gly Lys Val Lys Lys Glu 
20 25 30 

AAC ATA GAG GAA AAA GGG GAA AAG GAC TTT GTA AGT TAG GTG GAT AAA 144 
Asn He Glu Glu Lys Gly Glu Lys Asp Phe Val Ser Tyr Val Asp Lys 
35 40 45 

ACT TCA GAG GAA AGG ATA AAG GAG GTG ATA CTC AAG TTC TTT CCC GAT 192 
Thr Ser Glu Glu Arg He Lys Glu Val He Leu Lys Phe Phe Pro Asp 
50 55 60 

CAC GAG GTC GTA GGG GAA GAG ATG GGT GCG GAG GGA AGC GGA AGC GAA 240 
His Glu Val Val Gly Glu Glu Met Gly Ala Glu Gly Ser Gly Ser Glu 
65 70 75 80 

. TAG AGG TGG TTC ATA GAC CCC CTT GAC GGC ACA AAG AAC TAG ATA AAC 288 
Tyr Arg Trp Phe He Asp Pro Leu Asp Gly Thr Lys Asn Tyr He Asn 
85 90 95 

GGT TTT CCC ATC TTT GCC GTA TCA GTG GGA CTT GTT AAG GGA GAA GAG 336 
Gly Phe Pro He Phe Ala Val Ser Val Gly Leu Val Lys Gly Glu Glu 
100 105 110 

CCA ATT GTG GGT GCG GTT TAG CTT CCT TAG TTT GAC AAG CTT TAG TGG 3 84 

Pro He Val Gly Ala Val Tyr Leu Pro Tyr Phe Asp Lys Leu Tyr Trp 
115 120 125 



GGT GGT AAA GGT CTC GGG GCT TAG GTA AAC GGA AAG AGG ATA AAG GTA 



432 



Gly Ala Lys Gly Leu Gly Ala Tyr Val Asn Gly Lys Arg lie Lys Val 
130 135 140 



AAG GAC AAT GAG AGT TTA AAG CAC GCC GGA GTG GTT TAG GGA TTT CCC 480 
Lys Asp Asn Glu Ser Leu Lys His Ala Gly Val Val Tyr Gly Phe Pro 
145 150 155 160 

TCT AGG AGC AGG AGG GAC ATA TCT ATC TAG TTG AAC ATA TTC AAG GAT 52 8 

Ser Arg Ser Arg Arg Asp He Ser He Tyr Leu Asn He Phe Lys Asp 
165 170 175 

GTC TTT TAG GAA GTT GGC TCT ATG AGG AGA CCC GGG GCT GCT GCG GTT 576 
Val Phe Tyr Glu Val Gly Ser Met Arg Arg Pro Gly Ala Ala Ala Val 
180 185 190 



GAC CTC TGC ATG GTG GCG GAA GGG ATA TTT GAC GGG ATG ATG GAG TTT 624 

Asp Leu Cys Met Val Ala Glu Gly He Phe Asp Gly Met Met Glu Phe 
195 200 205 

GAA ATG AAG CCG TGG GAC ATA ACC GCA GGG CTT GTA ATA CTG AAG GAA 672 

Glu Met Lys Pro Trp Asp He Thr Ala Gly Leu Val He Leu Lys Glu 
210 215 220 

GCC GGG GGC GTT TAC ACA CTT GTG GGA GAA CCC TTC GGA GTT TCG GAC 720 

Ala Gly Gly Val Tyr Thr Leu Val Gly Glu Pro Phe Gly Val Ser Asp 

225 230 235 240 

ATA ATT GCG GGC AAC AAA GCC CTC CAC GAC TTT ATA CTT CAG GTA GCC 768 

He He Ala Gly Asn Lys Ala Leu His Asp Phe He Leu Gin Val Ala 
245 250 255 



AAA AAG TAT ATG GAA GTG GCG GTG TGA 795 
Lys Lys Tyr Met Glu Val Ala Val 
260 

(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 
• (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO:28: 






Met 


Arg 


Gly 


Ser 


Gly 


Val 


Arg 


He 


Leu 


Leu 


Thr 


Asn 


Asp Asp Gly 


He 


1 




5 










10 






15 




Phe 


Ala 


Glu 


Gly 


Leu 


Gly 


Ala 


Leu 


Arg 


Lys 


Met 


Leu 


Glu Pro Val 


Ala 








20 








25 








30 




Thr 


Leu 


Tyr 


Val 


Val 


Ala 


Pro 


Asp 


Arg 


Glu 


Arg 


Ser 


Ala Ala Ser 


His 






35 










40 










45 




Ala 


He 


Thr 


Val 


His 


Arg 


Pro 


Leu 


Arg 


Val 


Arg 


Glu 


Ala Gly Phe 


Arg 




50 








55 










60 






Ser 


Pro 


Arg 


Leu 


Lys 


Gly 


Trp 


Val 


val 


Asp 


Gly 


Thr 


Pro Ala Asp 


Cys 


65 






70 










75 






80 


Val 


Lys 


Leu 


Gly 


Leu 


Glu 


val 


Leu 


Leu 


Pro 


Glu 


Arg 


Pro Asp Phe 


Leu 






85 










90 






95 




Val 


Ser 


Gly 


He 


Asn 


Tyr 


Gly 


Pro 


Asn 


Leu 


Gly 


Thr 


Asp Val Leu 


Tyr 






100 








105 








110 




Ser 


Gly 


Thr 


Val 


Ser 


Ala 


Ala 


He 


Glu 


Gly 


Val 


He 


Asn Gly He 


Pro 




115 










120 










125 




Ser 


Val 


Ala 


Val 


Ser 


Leu 


Ala 


Thr 


Arg 


Arg 


Glu 


Pro 


Asp Tyr Thr 


Trp 





130 










135 




Ala 


Ala 


Arg 


Phe 


Val 


Leu 


Val 


Leu 


145 










150 






Leu 


Pro 


Pro 


Gly 


Thr 
165 


Leu 


Leu 


Asn 


Arg 


Gly 


Val 


Lys 
180 


Val 


Thr 


Lys 


Leu 


Val 


Asp 


Cys 
195 


Arg 


Thr 


Asp 


Pro 


Arg 
200 


Gly 


Glu 
210 


Pro 


Leu 


Glu 


Leu 


Asp 
215 


Gly 


Ala 


Val 


Arg 


Glu 


Gly 


Tyr 


He 


Ser 


225 










230 






Thr 


Asn 


Tyr 


Gly 


Phe 


Leu 


Glu 


Glu 








245 








He 


Phe 


Ser 


Ser 
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(2) INFORMATION FOR 









140 










Leu 


Glu 


Glu 


Leu 


Arg 


Lys 


His 


Gin 






155 










160 


Val 


Asn 


Val 


Pro 


Asp 


Gly Val 


Pro 




170 










175 




Gly 


Ser 


Val 


Arg 


Tyr 


Val 


Asn 


Val 


185 










190 






Gly 


Lys 


Ala 


Tyr 


Tyr 


Trp 


Met 


Ala 










205 








Asn 


Asp 


Ser 


Glu 


Thr 


Asp 


Val 


Trp 








220 










Val 


Thr 


Pro 


Val 


Gin 


He 


Asp 


Leu 






235 










240 


Leu 


Lys 


Lys 


Trp 


Arg 


Phe 


Lys 


Asp 




250 










255 





ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Met 


Leu 


Asp 


He 


Leu 


Leu 


Val 


Asn 


Asp 


Asp 


Gly 


He 


Tyr 


Ser 


Asn 


Gly 


1 






5 










10 










15 




Leu 


He 


Ala 


Leu 


Lys 


Asp 


Ala 


Leu 


Leu 


Glu 


Lys 


Phe 


Asn 


Ala 


Arg 


He 








20 








25 










30 






Thr 


He 


Val 


Ala 


Pro 


Thr 


Asn 


Gin 


Gin 


Ser 


Gly 


He Gly 


Arg Ala 


He 






35 










40 










45 








Ser 


Leu 
50 


Phe 


Glu 


Pro 


Leu 


Arg 
55 


He 


Thr 


Lys 


Thr 


Lys 
60 


Leu 


Ala 


Asp 


Gly 


Ser 


Trp 


Gly 


Tyr 


Ala 


Val 


Ser 


Gly 


Thr 


Pro 


Thr 


Asp 


Cys 


Val 


He 


Leu 


65 






70 










75 










80 


Gly 


He 


Tyr 


Glu 


He 


Leu 


Lys 


Lys 


Val 


Pro 


Asp 


Val 


Val 


He 


Ser 


Gly 






85 










90 










95 




He 


Asn 


He 


Gly 


Glu 


Asn 


Leu 


Gly 


Thr 


Glu 


He 


Thr 


Thr 


Ser Gly 


Thr 








100 










105 










110 






Leu 


Gly 


Ala 


Ala 


Phe 


Glu 


Gly 


Ala 


His 


His 


Gly 


Ala 


Lys 


Ala 


Leu 


Ala 




115 










120 










125 








Ser 


Ser 


Leu 


Gin 


Val 


Thr 


Ser 


Asp 


His 


Leu 


Lys 


Phe 


Lys 


Glu Gly 


Glu 




130 










135 










140 










Thr 


Pro 


He 


Asp 


Phe 


Thr 


Val 


Pro 


Ala 


Arg 


He 


Thr 


Ala 


Asn 


Val 


Val 


145 








150 










155 










160 


Glu 


Lys 


Met 


Leu 


Asp 


Tyr 


Asp 


Phe 


Pro 


Cys 


Asp 


Val 


Val 


Asn 


Leu 


Asn 








165 










170 










175 




He 


Pro 


Glu 


Gly 
180 


Ala 


Thr 


Glu 


Lys 


Thr 
185 


Pro 


He 


Glu 


He 


Thr 
190 


Arg 


Leu 


Ala 


Arg 


Lys 


Met 


Tyr 


Thr 


Thr 


His 


Val 


Glu 


Glu 


Arg 


He 


Asp 


Pro 


Arg 




195 










200 










205 








Gly 


Arg 


Ser 


Tyr 


Tyr 


Trp 


He 


Asp 


Gly 


Tyr 


Pro 


He 


Leu 


Glu 


Glu 


Glu 


210 










215 










220 










Glu 


Asp 


Thr 


Asp 


Val 


Tyr 


Val 


Val 


Arg 


Arg 


Lys 


Gly His 


He 


Ser 


Leu 


225 








230 










235 










240 


Thr 


Pro 


Leu 


Thr 


Leu 


Asp 


Thr 


Thr 


He 


Lys 


Asn 


Leu 


Glu 


Glu 


Phe 


Lys 










245 








250 










255 




Lys 


Lys 


Tyr 


Glu 


Arg 


He 


Leu 


Asn 


Glu 

















260 265 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Met 


Met 


Met 


Glu 


Phe 


Thr 


Arg 


Glu 


1 

Leu 


Gin 


Gly 


Leu 


5 

Gly 


Glu 


He 


Tyr 








20 










Ser 


Ala 


Ser 


Gly 


Arg 


Ala 


Met 


Thr 






35 










40 


Arg 


He 


Ser 


Met 


Asn 


Gly 


Ala 


Lys 




50 










55 




Pro 


Val 


Asp 


Cys 


Val 


He 


Phe 


Ala 


65 










70 






Leu 


Ala 


He 


Ser 


Gly 


Val 


Asn 


Leu 










85 








Thr 


Val 


Ser 


Gly 


Thr 


Ala 


Ser 


Ala 








100 










He 


Pro 


Ser 


He 


Pro 


He 


Ser 


Leu 






115 










120 


Phe 


Gly 


Glu 


Gly 


Glu 


Glu 


He 


Asp 




130 










135 




Arg 


Lys 


He 


Ala 


Thr 


Ala 


Val 


Leu 


145 










150 






Asp 


Met 


Leii 


Asn 


Val 


Asn 


Val 


Pro 








165 








He 


Ala 


Phe 


Thr 


Arg 


Leu 


Ala 


Arg 








180 










Glu 


Arg 


He 


Asp 


Pro 


Lys 


Gly 


Asn 






195 










200 


Gin 


Cys 


Pro 


Lys 


Glu 


Ala 


Leu 


Glu 




210 










215 




Lys 


Val 


Glu 


Arg 


Lys 


Val 


Ser 


Val 


225 










230 






Ala 


Arg 


Val 


Asn 


Leu 


Asp 


Glu 


He 



245 



Gly 


He 


Lys 


Ala 


Ala 


Val 


Glu 


Ala 




10 










15 




Val 


Val 


Ala 


Pro 


Met 


Phe 


Gin 


Arg 


25 










30 






He 


His 


Arg 


Pro 


Leu 
45 


Arg 


Ala 


Lys 


Ala 


Ala 


Tyr 


Ala 


Leu 


Asp 


Gly 


Met 








60 










Met 


Ala 


Arg 


Phe 


Gly 


Asp 


Phe 


Asp 






75 










80 


Gly 


Glu 


Asn 


Met 


Ser 


Thr 


Glu 


He 


90 










95 




Ala 


He 


Glu 


Ala 


Ala 


Thr 


Gin 


Glu 


105 










110 






Glu 


Val 


Asn 


Arg 


Glu 


Lys 


His 


Lys 










125 








Phe 


Ser 


Ala 


Ala 


Lys 


Tyr 


Phe 


Leu 








140 










Lys 


Arg 


Gly 


Leu 


Pro 


Lys 


Gly 


Val 






155 










160 


Tyr 


Asp 


Ala 


Asn 


Glu 


Arg 


Thr 


Glu 




170 










175 




Arg 


Met 


Tyr 


Arg 


Pro 


Ser 


He 


Glu 


185 










190 






Pro 


Tyr 


Tyr 


Trp 


He 


Val 


Gly 


Thr 










205 








Pro 


Gly 


Thr 


Asp 


Met 


Tyr 


Val 


Val 








220 










Thr 


Pro 


He 


Asn 


He 


Asp 


Met 


Thr 






235 










240 


Lys 


Arg 


Leu 


Leu 


Glu 


Leu 







250 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Arg Thr Leu Thr He Asn Thr Asp Ala Glu Gly Phe Val Leu Arg 

15 10 15 

He Leu Leu Thr Asn Asp Asp Gly He Tyr Ser Asn Gly Leu Arg Ala 

20 25 30 

Ala Val Lys Ala Leu Ser Glu Leu Gly Glu Val Tyr Val Val Ala Pro 



35 40 45 



Leu 


Phe 


Gin 


Ara 


Ser 


Ala 


Ser 


Gly 


Aro 


Ala 


Met 


Thr 


Leu His 


Arg Pro 




50 










55 










60 






lie 


Arcr 


Ala 


Lys 


Ara 


Val 


ASD 


Val 


Pro 


Gly 


Ala 


Lys 


He Ala 


Tyr Gly 


65 










70 










75 






80 


He 


Asp 


Gly 


Thr 


Pro 


Thr 


Asp 


Cys 


Val 


He 


Phe 


Ala 


He Ala 


Arg Phe 








85 










90 








95 


Gly 


Ser 


Phe 


Gly 


Leu 


Ala 


Val 


Ser 


Gly 


He 


Asn 


Leu 


Gly Glu 


Asn Leu 






100 










105 








110 




Ser 


Thr 


Glu 


He 


Thr 


Val 


Ser 


Gly 


Thr 


Ala 


Ser 


Ala 


Ala He 


Glu Ala 






115 










120 










125 




Ser 


Thr 


His 


Gly 


He 


Pro 


Ser 


He 


Ala 


He 


Ser 


Leu 


Glu Val 


Glu Trp 




130 








135 










14 0 






Lvs 


Lys 


Thr 


Leu 


Gly 


Glu 


Gly 


Glu 


Gly 


Val 


Asp 


Phe 


Ser Val 


Ser Thr 


145 








150 










155 






160 


His 


Phe 


Leu 


Lvs 


Arcr 


He 


Ala 


Gly 


Ala 


Leu 


Leu 


Glu 


Arg Gly 


Leu Pro 








165 










170 








175 


Glu 


Gly 


Val 


ASD 


Met 


Leu 


Asn 


Val 


Asn 


Val 


Pro 


Ser 


Asp Ala 


Thr Glu 






180 










185 








190 




Glu 


Thr 


Glu 


He 


Ala 


He 


Thr 


Arg 


Leu 


Ala 


Arg 


Lys 


Arg Tyr 


Ser Pro 






195 










200 










205 




Thr 


Val 


Glu 


Glu 


Arg 


He 


Asp 


Pro 


Lys 


Gly 


Asn 


Pro 


Tyr Tyr 


Trp He 




210 










215 










220 






Val 


Gly 


Lys 


Leu 


Val 


Gin 


Asp 


Phe 


Glu 


Pro 


Gly Thr 


Asp Ala 


Tyr Ala 


225 






230 










235 






240 


Leu 


Lys 


Val 


Glu 


Arg 


Lys 


Val 


Ser 


Val 


Thr 


Pro 


He 


Asn He 


Asp Met 








245 










250 








255 


Thr 


Ala 


Arg 


Val 


Asp 


Phe 


Glu 


Glu 


Leu 


Val 


Arg Val 


Leu Trp 


Val 






260 








265 








270 








(2) 


INFORMATION FOR SEQ ID 


NO: 32: 










(i) SEQUENCE CHARACTERISTICS: 
















(A) 


LENGTH : 


497 


amino acids 
















(B) 


TYPE : amino 


acid 


















(D) 


TOPOLOGY: linear 
















(ii) MOLECULE 


TYPE: protein 














(v) FRAGMENT TYPE: 


: internal 














(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 32: 






Met 


Lys 


Gly 


Lys 


Ser 


Leu 


Val 


Ser 


Gly 


Leu 


Leu 


Leu 


Gly Leu 


Leu He 


1 


5 










10 








15 


Leu 


Ser 


Leu 


He 


Ser 


Phe 


Gin 


Pro 


Ser 


Phe 


Ala 


Tyr 


Ser Pro 


His Gly 








20 










25 








30 




Gly 


Val 


Lys 


Asn 


He 


He 


He 


Leu 


Val 


Gly 


Asp 


Gly 


Met Gly 


Leu Gly 




35 










40 










45 




His 


Val 


Glu 


He 


Thr 


Lys 


Leu 


Val 


Tyr 


Gly 


His 


Leu 


Asn Met 


Glu Asn 




50 








55 










60 






Phe 


Pro 


Val 


Thr 


Gly 


Phe 


Glu 


Leu 


Thr 


Asp 


Ser 


Leu 


Ser Gly 


Glu Val 










70 










75 






80 


Thr 


Asp 


Ser 


Ala 


Ala 


Ala 


Gly 


Thr 


Ala 


He 


Ser 


Thr 


Gly Ala 


Lys Thr 








85 








90 








95 


lyi: 


Asn 


Gly 


Met 


He 


Ser 


Val 


Thr 


Asn 


He 


Thr 


Gly 


Lys He 


Val Asn 




100 










105 








110 




Leu 


Thr 


Thr 


Leu 


Leu 


Glu 


Val 


Ala 


Gin 


Glu 


Leu 


Gly 


Lys Ser 


Thr Gly 






115 










120 










125 




Leu 


Val 


Thr 


Thr 


Thr 


Arg 


He 


Thr 


His 


Ala 


Thr 


Pro 


Ala Val 


Phe Ala 




130 








135 










140 






Ser 


His 


Val 


Pro 


Asp 


Arg 


Asp 


Met 


Glu 


Gly 


Glu 


He 


Pro Lys 


Gin Leu 


145 








150 










155 






160 


He 


Met 


His 


Lys 


val 


Asn 


Val 


Leu 


Leu 


Gly 


Gly 


Gly 


Arg Glu 


Lys Phe 








165 










170 








175 



Asp 


Glu 


Lvs 


Asn 


Leu 


Glu 


Leu 


Ala 








180 










Phe 


Thr 


Lvs 


Glu 


Glu 


Leu 


Glu 


Lvs 






195 










200 


Leu 


Phe 


Ala 


Glu 


Ser 


His 


He 


Pro 




210 










215 




ASD 


Val 


Gly 


Leu 


Leu 


Glu 


Met 


Ala 


225 










230 






Lvs 


Asn 


Pro 


Ser 


Gly 


Phe 


Phe 


Leu 










245 








His 


Ala 


Ala 


His 


Gly 


Asn 


Asp 


Val 








260 










Glu 


Phe 


ASD 


ASD 


Val 


Val 


Arcr 


Tyr 






275 










280 


Glv 


ASD 


Thr 


Leu 


Val 


He 


Val 


Leu 




290 










295 




Ala 


lie 


Glv 

w J. jr 


Leu 


Thr 


Tvr 


Gly 


Asn 


305 










310 






Lys 


lie 


Lys 


Ala 


Ser 


Thr 


Leu 


Arcr 










325 








Ser 


Ser 


Val 


Lvs 


Glu 


Ser 


Ser 


Lys 








340 










Glv 


Aro 


Ser 


Gin 


Tvr 


He 


Glu 


Asn 






355 










360 


Ala 


Leu 


Ser 


Asn 


Ala 


Val 


Ala 


Asp 




370 










375 




Glv 


Phe 


Thr 


Ser 


Tvr 


Glu 


His 


Thr 


385 










390 








Glv 


Pro 


Glv 


Ala 


Glu 


Asn 


Phe 










405 








Thr 


Ala 


ArQ 


Leu 


Val 


Ala 


Lys 


Leu 








420 










Pro 


Val 


Thr 


He 


Ser 


Ser 


Val 


Ser 






435 










440 


Asp 


Tyr 


Arg 


Val 


Asp 


Glu 


Lys 


Asp 




450 










455 




Leu 


Gly 


Glu 


Lys 


Val 


Asp 


Asn 


Glu 


465 










470 






Asn 


Asn 


Gly 


Met 


Val 


Asp 


Leu 


Asn 



485 



Ala 



Lvs 


Lvs 


Gin Gly Tyr 


Lvs 


Val 


Val 


185 






190 






Val 


Glu 


Gly Asp Tyr 


Val 


Leu 


Gly 






205 








Tyr 


Val 


Leu Asp Arg 


Lys 


Pro 


Asp 






220 








Lys 


Lys 


Ala He Ser 


He 


Leu 


Glu 






235 






240 


Met 


Val 


Glu Gly Gly 


Arg 


He 


Asp 




250 






255 




Ala 


Ser 


Val Val Ala 


Glu 


Thr 


Lys 


265 






270 






Val 


Leu 


Glu Tyr Pro 


Lys 


Lys 


Arg 






285 








Ala 


Asp 


His Glu Thr 


Gly Gly 


Leu 






300 








Ala 


He 


Asp Glu Asp 


Ala 


He 


Arg 






315 






320 


Met 


Pro 


Lys Glu Val 


Lys 


Ala 


Gly 




330 






335 




Val 


Cys 


Arg He Cys 


Pro 


Asn 


Arg 


345 






350 






Ala 


Leu 


His Ser Thr 


Asn 


Lys 


Tyr 






365 








Val 


He 


Asn Arg Arg 


He Gly 


Val 






380 








Gly 


Val 


Pro Val Pro 


Leu 


Leu 


Ala 




395 






400 


Ara 


Gly 


Phe Leu His 


His 


Val 


Asp 




410 






415 




Met 


Leu 


Phe Gly Arg 


Arg Asn 


He 


425 






430 






Ser 


Val 


Lys Gly Asp 


He 


Thr 


Gly 






445 








Ala 


Tyr 


Val Thr Leu 


Met 


Met 


Phe 




460 








He 


Glu 


Lys Arg Val 


Asp 


He 


Asp 






475 






480 


Asp 


Val 


Met Leu He 


Leu 


Gin 


Glu 


490 






495 





(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Met Pro Arg Asn He Ala Ala Val Cys Ala Leu Ala Ala Leu Leu Gly 

15 10 15 

Ser Ala Trp Ala Ala Lys Val Ala Val Tyr Pro Tyr Asp Gly Ala Ala 

20 25 30 

Leu Leu Ala Gly Gin Arg Phe Asp Leu Arg He Glu Ala Ser Glu Leu 

35 40 45 

Lys Gly Asn Leu Lys Ala Tyr Arg He Thr Leu Asp Gly Gin Pro Leu 

50 55 60 

Ala Gly Leu Glu Gin Thr Ala Gin Gly Ala Gly Gin Ala Glu Trp Thr 











Leu 


Arg 


o j.y 


ai a 


Leu 




a cn 


a 








100 


Arg 


Gin 


Asn 


Leu 






115 




T 1 o 

1 xe 




Asp 


wxy 




1 "10 

X J v 






A J. a 


Lys 


y 


irllc 


145 








J. ±6 


ox Li 


OCX 


ux y 


_ 


OCX 


IrllC 


lie 








180 


Gly 


Gin 


Lys 


Val 






195 




ijy s 




1 IIX 


XIC u 




^ X V 






Lys 


Arg 


Vax 


a irrt 

Arg 


^ Z 3 










A an 


Ala 


Thr 

XXIX 


ASp 


iyr 


m n 


Al a 
nxa 








260 


Val 


Pro 


Leu 


Asp 






^ / 3 




oJ.Il 


ser 


lllx 


fx t? 




290 








Ser 




T Aze 

jjy D 


7 n ^ 

J VJ 3 








Leu 


AX a 


al a 


_ 

Lys 


A.SI1 


file 


It X \J 


OCX 








340 


Leu 


Gly 


Ser 


Phe 






J ^ ^ 






VaX 


VjXU 


al a 

nXa 




O f \J 










vjxy 


i*lC L» 


385 








Ala 


XICU 




Asp 


variy 


lyr 


al a 


Ala 








420 


His 


Ala 


His 


Ser 






'X J ^ 






Arg 


r*! 11 
vjXU 


uxy 




*S 3 V 






vjxy 


Asp 


Lys 


ijy D 


1 Q 3 










X XG 


al a 


Val 

V CLX 


Arg 


1 ^7 
vjxy 


Arg 


m 11 

ox u 








500 


Gly 


Tyr 


Val 


Ala 






3X3 




Tyr 


Arg 


(jxn 


Leu 




530 






Pro 


Met 


Pro 


Leu 


545 








Leu 


He 


Asp 


Gin 


Phe 


Asn 


Pro 


His 








580 



70 



Phe 


Leu 


Arcf 
rtxy 


Pro 


85 








Ala 


Gly 


Glu 


Ser 


Arg 


Leu 


Pro 


Arg 








120 


Met 


Glv 
\jxy 


AXp 


Asn 






135 




Asn 


Pro 


Glu 


Asn 




150 






Tvr 


Glv 


Gly 


Met 


165 








Ala 


Asp 


Ser 


Ala 


Gin 


Val 


Asn 


Ala 








200 


Ala 


Tvr 

j.y X 


Pro 


Ara 






215 




Gly 


Ala 


Ser 


He 




230 






Pro 


Ala 


Ser 


Leu 


245 








He 


Ala 


Asp 


Met 


Val 


Met 


Leu 


Phe 








280 


Glv 

wx/ 


Ser 


Ara 


Ara 






295 




Leu 


Glv 


Tvr 


Thr 




310 






Pro 


Thr 


Asp 


Lys 


325 








TVT" 
J./X 


Leu 


Asp 


Ara 


Thr 


Asp 


Met 


Pro 








360 


Leu 


Ser 


Ara 


Asn 






375 




Val 


ASD 


Lys 


Tyr 




390 






Val 


Leu 


Glu 


Leu 


405 








Ser 


His 


Pro 


Asp 


He 


Ser 


Val 


Phe 








440 


Val 
vox 


nlv 


Val 

vox 


Tvr 
xyx 






^ ^ 




Asp 


Ala 


Asn 


Glv 




470 






Glv 


Phe 


Glv 


Ala 


*> o ^ 








Val 


xy X 


Lys 


Asp 


Asn 


Pro 


Glu 


Val 








520 


Pro 


Val 


Asp 


Ser 






535 




Phe 


Ala 


Phe 


Gly 




550 






Thr 


Glu 


He 


Phe 


565 








Leu 


Glu 


Lys 


Pro 







7 ^ 




vjx y 


OCX 


fix o 


Thr 

1 IIX 




90 






Arg 


xiy s 


OCX 


Val 
V d X 


105 








Ala 


Ala 


Lys 


Asn 


Thr 


Leu 


Asn 


Ala 








140 


Gly 


Met 


Pro 


Asn 






1 

X ^ «7 




Al a 
i^x a 


L xlX 


Val 

V CIX 


Thr 




170 






Asn 


Ser 


Ala 


Ser 


185 








Leu 


Asn 


Val 


Tvr 

xyx 


He 


Glu 


Thr 


Leu 








220 


Gly 


Val 


Val 


Thr 






235 




a a n 


Ala 


nx o 


Thr 

X XXX 




^ ^ u 






iyr 


IrllC 


Glv 
vjxy 


Ara 


265 








nl V 

wxy 


ni V 
oxy 


S^r 

OCX 


Ara 


Lys 


Asp 


Ser 


Thr 








300 


Phe 


Val 


Ser 


Thr 






315 




uc xx 


It liC 


ni V 


Leu 




J >^ \J 






Al a 
nXd 


Val 
V d X 




Lys 


345 








lyr 


T.^i 1 
XlC KX 


ixp 


ni u 

VJX u 


Asp 


Lys 


Glv 
uxy 


Phe 








380 


Glu 


His 


Pro 


Leu 






395 






Ara 


Ala 


Val 




410 






XllX 


JJC u 


Val 


He 


425 








fJl V 

vjxy 


m V 

VJX y 


Tvr 
lyt 


Asp 


ni 11 

VJX u 


Ala 

nxo 


Ala 


Lys 








460 


Phe 


Pro 


Leu 


Pro 






475 






c X 




Tvr 
lyx 




490 






It X 


Thr 

X IIX 


Tl #» 
X xc 


OCX 


505 








Cys 


Lys 


Glu 


Pro 


Ala 


Gin 


Gly 


Val 








540 


Val 


Gly 


Ser 


Gin 






555 




Phe 


Arg 


Met 


Ala 



570 



Leu 


Glu 


Val 


Ser 






95 




Ara 


Trp 


Glu 


Ala 




110 






Val 


He 


Leu 


Phe 


125- 








Ala 


Arq 


He 


He 


Gly 


Asn 


Leu 


Glu 








160 


Thr 


Gly 


Ser 


Phe 






175 




Ser 


He 


Met 


Thr 




190 






Pro 


Ser 


Asn 


Leu 


205 








Ala 


Glu 


Met 


Leu 


Thr 


Thr 


Phe 


Gly 








240 


Ara 


Arq 


Arq 


Gly 






255 




Gly 


Gly 


Phe 


Gly 




270 






Asp 


Phe 


He 


Pro 


285 








Asp 


Trp 


He 


Ala 


Arg 


Ser 


Glu 


Leu 








320 


Phe 


Asn 


He 


Asp 






335 




Arq 


Pro 


Glu 


Met 




350 






Met 


Thr 


Gin 


Lys 


365 








Phe 


Leu 


Met 


Val 


Asp 


Trp 


Pro 


Arg 








400 


Ala 


Trp 


Ala 


Lys 






415 




Val 


Thr 


Ala 


Asp 




430 






Tyr 


Ser 


Lys 


Gin 


445 








Phe 


Pro 


Thr 


Tyr 


Asp 


Thr 


Thr 


Arg 








480 


Cys 


Glu 


Thr 


Tyr 






495 




Asp 


Gly 


Lys 


Gly 




510 






Gly 


Leu 


Pro 


Thr 


525 








His 


Thr 


Ala 


Asp 


Phe 


Phe 


Asn 


Gly 








560 


Gin 


Ala 


Leu 


Gly 






575 





(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 



Met 


Tvr 


Lys 


TrD 


He 


He 


Glu 


Gly 


Lvs 


Leu 


Ala 


Gin 


Ala 


Pro 


Phe Pro 


1 


5 










10 










15 


Ser 


Leu 


Glv 


Glu 


Leu 


Ala 


ASD 


Leu 


Lys 


Arg 


Leu 


Phe 


Asp 


Ala 


He He 






20 










25 










30 




V 




Thr 


Met 


Pro 


His 


Glu 


Gin 


Pro 


Leu 


Asn 


Glu 


Lys 


Tyr 


He Glu 






35 










40 










45 






Tie 

JL J. C 


Leu 


Glu 


Ser 


His 


Glv 


Phe 


Gin 


Val 


Leu 


His 


Val 


Pro 


Thr 


Leu Asp 




50 








55 










60 








Phe 


His 


Pro 


Leu 


Glu 


Leu 


Phe 


Asp 
****** 


Leu 


Leu 


Lys 


Thr 


Ser 


He 


Phe He 


65 










70 










75 








80 


Asp 


Glu 


Asn 


Leu 


Glu 


ArQ 


Ser 


His 


Arq 


Val 


Leu 


Val 


His 


Cys 


Met Gly 








85 










90 










95 


Glv 


lie 


Glv 


Ara 


Ser 


Glv 


Leu 


Val 


Thr 


Ala 


Ala 


Tyr 


Leu 


He 


Phe Lys 




100 










105 










110 




Glv 


Tvr 


Asp 


He 


Tvr 


Asp 


Ala 


Val 


Lys 


His 


Val 


Arg 


Thr 


Val 


Val Pro 


115 










120 










125 






Glv 


Ala 


He 


Glu 


Asn 


Arg 


Gly 


Gin 


Ala 


Leu 


Met 


Leu 


Glu 


Asn 


Tyr Tyr 


130 










135 










140 








Thr 


Leu 


Val 


Lys 


Ser 


Phe 


Asn 


Arg 


Glu 


Leu 


Leu 


Arg 


Asp 


Tyr 


Gly Lys 


145 








150 










155 








160 


Lys 


He 


Phe 


Thr 


Leu 


Gly 


Asp 


Pro 


Lys 


Ala 


Val 


Leu 


His 


Ala 


Ser Lys 








165 










170 










175 


Thr 


Thr 


Gin 


Phe 


Thr 


He 


Glu 


Leu 


Leu 


Ser 


Asn 


Leu 


His 


val 


Asn Glu 








180 










185 










190 




Ala 


Phe 


Ser 


He 


Ser 


Ala 


Met 


Ala 


Gin 


Ser 


Leu 


Leu 


His 


Phe 


His Asp 






195 










200 










205 






Val 


Lys 


Val 


Arg 


Ser 


Lys 


Leu 


Lys 


Glu 


Val 


Phe 


Glu 


Asn 


Met 


Glu Phe 




210 










215 










220 








Ser 


Ser 


Ala 


Ser 


Glu 


Glu 


Val 


Leu 


Ser 


Phe 


He 


His 


Leu 


Leu 


Asp Phe 


225 










230 










235 








240 


Tyr 


Gin 


Asp 


Gly 


Arg 


val 


Val 


Leu 


Thr 


He 


Tyr Asp 


Tyr 


Leu 


Pro Asp 




245 










250 










255 


Arg 


Val 


Asp 


Leu 


He 


Leu 


Leu 


Cys 


Lys 


Trp Gly Cys 


Asp 


Lys 


He Val 




260 










265 










270 




Glu 


Val 


Ser 


Ser 


Ser 


Ala 


Lys 


Lys 


Thr 


Val 


Glu 


Lys 


Leu 


Val 


Gly Arg 






275 








280 










285 






Lys 


Val 


Ser 


Leu 


Ser 


Trp 


Ala 


Asn 


Tyr 


Leu Asp 


Tyr 


Val 






290 










295 










300 









(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Met 


Ara 


He 


Leu 


Leu 
5 


Thr 


Asn 


Asp 


Arg 


Ala 


Ala 


Val 


Lys 


Gly 


Leu 


Ser 








20 










Ala 


Pro 


Leu 


Phe 


Gin 


Ara 


Ser 


Ala 






35 








40 




Pro 


He 


Aro 


Ala 


Lvs 


Ara 


Val 




50 










55 




Tvr 

A jr J. 


Gly 


He 


Asp 


Gly 


Thr 


Pro 


Thr 


65 










70 






Arcf 


Phe 


Gly 


Asp 


Phe 
85 


Asp 


Leu 


Ala 


Asn 


Leu 


Ser 


Thr 


Glu 


He 


Thr 


Val 








100 










Glu 


Ala 


Ser 


Thr 


His 


Gly 


He 


Pro 






115 










120 


Glu 




Lys 


Lys 


Thr 


Leu 


Gly 


Glu 




130 










135 




Ser 


Ala 


His 


Phe 


Leu 


Ara 


Arg 


He 


145 










150 






Leu 


Pro 


Glu 


Gly 


Val 


Asp 


Met 


Leu 










165 








Ser 


Glu 


Gly 


Thr 


Glu 


He 


Ala 


He 








180 










Ser 


Pro 


Thr 


He 


Glu 


Glu 


Arg 


He 






195 










200 


Trp 


lie 


Val 


Gly 


Arg 


Leu 


Val 


Gin 


210 










215 




Tyr 


Ala 


Leu 


Lys 


Val 


Glu 


Arg 


Lys 


225 










230 






Asp 


Met 


Thr 


Ala 


Arg 


Val 


Asp 


Phe 



245 



Leu 



ASD 


Glv 


He 


Tyr 


Ser Asn Gly 


Leu 




10 










15 




Glu 


Leu 


Gly Glu 


Val 


Tyr 


Val 


Val 


25 










30 






Ser 


Gly 


Arg Ala 


Met 


Thr 


Leu 


His 










45 








Asp 


Val 


Pro Gly 


Ala 


Lys 


He 


Ala 








60 










Asp 


Cys 


Val 


He 


Phe 


Ala 


He 


Ala 






75 










80 


Val 


Ser 


Gly 


He 


Asn Leu Gly 


Glu 




90 










95 




Ser 


Gly 

srf A jr 


Thr 


Ala 


Ser 


Ala 


Ala 


He 


105 








110 






Ser 


Val 


Ala 


He 


Ser 


Leu 


Glu 


Val 










125 








Gly 


Glu 


Gly 


He 


Asp 


Phe 


Ser 


Val 








140 










Ala 


Thr 


Ala 


Val 


Leu 


Lys 


Lys 


Gly 






155 










160 


Asn 


Val 


Asn 


Val 


Pro 


Ser 


Asp 


Ala 




170 










175 




Thr 


Ara 


Leu 


Ala 


Arg 


Lys 


Arg 


Tyr 


185 










190 






Asp 


Pro 


Lys Gly 


Asn 


Pro Tyr 


Tyr 










205 








Glu 


Phe 


Glu 


Pro 


Gly Thr Asp 


Ala 








220 










Val 


Ser 


Val 


Thr 


Pro 


He 


Asn 


He 






235 










240 


Glu 


Asn 


Leu 


Gin 


Arg 


Leu 


Leu 


Ser 




250 










255 





(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTElRISTICS : 

(A) LENGTH: 264 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 36: 






Met 


Glu 


Asn 


Leu 


Lys 


Lys 


Tyr 


Leu 


Glu 


Val 


Ala 


Lys 


He Ala Ala 


Leu 


1 








5 








10 






15 




Ala 


Gly 


Gly 


Gin 


Val 


Leu 


Lys 


Glu 


Asn 


Phe 


Gly 


Lys 


Val Lys Lys 


Glu 




20 










25 








30 




Asn 


He 


Glu 


Glu 


Lys 


Gly 


Glu 


Lys 


Asp 


Phe 


Val 


Ser 


Tyr Val Asp 


Lys 






35 








40 










45 




Thr 


Ser 


Glu 


Glu 


Arg 


He 


Lys 


Glu 


val 


He 


Leu 


Lys 


Phe Phe Pro 


Asp 




50 








55 










60 






His 


Glu 


Val 


Val 


Gly 


Glu 


Glu 


Met 


Gly 


Ala 


Glu 


Gly 


Ser Gly Ser 


Glu 


65 








70 










75 






80 


Tyr 


Arg 


Trp 


Phe 


He 


Asp 


Pro 


Leu 


Asp 


Gly 


Thr 


Lys 


Asn Tyr He 


Asn 




85 










90 






95 




Gly 


Phe 


Pro 


He 


Phe 


Ala 


Val 


Ser 


Val 


Gly 


Leu 


Val 


Lys Gly Glu 


Glu 






100 










105 








110 




Pro 


He 


Val 


Gly 


Ala 


Val 


Tyr 


Leu 


Pro 


Tyr 


Phe 


Asp 


Lys Leu Tyr 


Trp 



115 120 125 



Gly 


Ala 


Lys 


Gly 


Leu 


Gly 


Ala 


Tyr 


Val 


Asn 


Gly Lys 


Arg He 


Lys 


vai 




130 










IOC 

135 








140 








Lys 


Asp 


Asn 


Glu 


Ser 


Leu 


Lys 


His 


Ala 


Gly 


Val Val 


Tyr Gly Phe 


Pro 


145 










150 










155 






X O w 


Sex 


Arg 


Ser 


Arg 


Arg 


Asp 


He 


Ser 


He 


Tyr 


Leu Asn 


He Phe 


Lys 


Asp 










lob 










1 / U 






175 




Val 


Pne 


Tyr 


Glu 


val 


Gly 


Ser 


Met 


Arg 


TV v<>^ 

Arg 


Pro Gly 


Ala Ala 


Ala 


V ai 








180 










lo5 






190 






Asp 


Leu 


Cys 


Mec 


vai 


Ala 


LtIU 


vjiy 


lie 




Asp tjiy 


Met Met 


Glu 


ir XiC 






195 










200 








205 






Glu 


Met 


Lys 


Pro 


Trp 


Asp 


He 


Thr 


Ala 


Gly 


Leu Val 


He Leu 


Lys 


Glu 




210 










215 








220 








Ala 


Gly 


Gly 


Val 


Tyr 


Thr 


Leu 


Val 


Gly 


Glu 


Pro Phe 


Gly Val 


Ser 


Asp 


225 










230 










235 






240 


He 


He 


Ala 


Gly 


Asn 


Lys 


Ala 


Leu 


His 


Asp 


Phe He 


Leu Gin 


Val 


Ala 








245 










250 






255 




Lys 


Lys 


Tyr 


Met 


Glu 


Val 


Ala 


Val 















260 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

TGCCCGAGCG TGTTGCCAAG ATGCTTGAAA GAATGCTATC CAAGGCGGAA TCTATGCTCG 60 

GCGACGCCCA GAGGCTTATC GAGGAGGGTA AGGCCGTTGA GGCTAAGAAG CTGTTAGCGG 120 

CTGCTCATAG GCTAGTAGAT CGCCTAGAGG ATGCTCTCGA CCACGCCCTC AACCATATAG 180 

AGCATCACAA GGAACATCAT GAGGAGCACC ACAAGGAGCA CGACTAACAA CACTCTTAGA 24 0 

ATCTCGAGAC GAGCTTGCTT CCCGTGTCTC TCGCGCCTAG CCAGTTTTTA ATAGCCTAAG 300 

CCGAGACCCA CATTCCAACA TTACTCCGTT TGTCACTATC ATGTTCTAAT TGTCACACGC 360 

CCCGTATAAA TTGGGGGACC TGGAGGAAGC GTTGCCGGTG ACCCCGCGTG GCCAAGAAGG 420 

CTGTCTGCCC AATATGCGGT GGCGATGTTG AACTACCCGA TAACGTAATG GATGGCGAGA 480 

TCGTGGAGCA CGACTGTGGG GCAATGCTAG TCGTGAGGAT CCGGGATGGC AATGTTGTTC 540 

TAGAGCAGTT GGAGCGCGTT GAGGAGGACT GGGGAGAGTA GAGGCTATGC GCATAGCAAT 600 

CGTTTATGAC CATCCGCGTG TTGAGGAGAA GAGGTTAGCT GAGGAAGCGA GGAAGCTTGG 660 

TCACGAACCT GTCCTCTTTA ATATTGACTC GTTGCTCTTT CGCCTTGATA GCCTGGAGCG 720 

CATTCTAGGC GATGTTGATG TAGTACTTCA GAGGGCGGTG AGTTACTTCA AGGCTCTCGA 780 

GTCTACAAGG ATACTCGAGG CTGCCGGCTA CACTGTCATC AACAATAGTT TAGTGCAGCT 840 

TAACTGCGGC GACAAACTAT TGACAACGAT CTTGCTTGCT AAGCATGGTG TGCCAACACC 900 

GCGTGCATAC GCTGCTTTTT CGCGTGACAC TGCTGTGCGG GCTGCAGAGG AGCTTGGATA 960 

CCCCGTTGTT GTCAAGCCCG TCATTGGTAG TTGGGGTAGG CTTGTGGCTA GGGCTGATTC 1020 

CAGGGAGAGT CTAGAGGCTG TGATAGAGCA TAGAGAGGTT CTCGGCCCGG CTTACTACAA 1080 

GGTTCATTAT GTGCAAGAGT ATGTGCGCAA GCCTCTACGT GACATACGCG TATTCGTGAT 1140 

TGGTGATGAG GTTCCCGTGG CGATATACAG GGTTAACGAG CGTCATTGGA AGACTAACAC 12 00 

GGCACTAGGC GCCAAGGCCG AGCCTGCGCC AGTGACCCCC GAGTTACGTG AGTTAGCGCT 1260 

TCGCGCGGCC AAGGCTGTGG GTGGCGGTGT GCTTGGTATA GATGTGTTTG AAGACCCGGA 132 0 

GAGAGGCCTC CTCGTGAACG AGATTAACGC GAACTCGGAC TTCAAGAACA CTGAGAGGGT 1380 

GACCGGGTTT AACATGGCTA GGGCTATCGT CGAGTATGCA GTGTCGGTCG CGAAGAGGTG 1440 

AATGGAATGG ATAGGGTAGA GGTGCTTCTG GATGAGGCTA GGCGTGGCGC TATAGAGGGT 1500 

GACGCTCGCC GCGCATGTGA AGCGGCATTA AGGCTGGTTG ACGTTGTGCT CCGCGAGGGG 1560 

CCTAGGGTTG CACAGGAGTC TGGGCGTGGG ATTGAACCCG GTGATGTACT ACTAGCTGAG 1620 

GCTCTGAGCT TGAGAGCAGA GCAGGTGAAG GAGGAGCCCA AGGCGGACAA TTGTCTGGAG 1680 

CTCGCAAAGG CTGCATTCCG CCTCTATAAG CGGCTCCAGG GGATGGAGTA AAGTTCGCAG 174 0 

TGTGTTGCCC GTTTTAGCCT CTGCCTTACT TTCTACTCGC GTGAGGCGAG TGTCCCTTGA 1800 

CACGTTGCTG GCGCGAGCTG AGAAACGACC TCGAGATGAT ACCCGAGATC GTCGAGAAGC 1860 

AGATCGAGGA GACGATAGTG CCGGAGGGTC TTGGCGAGCA ACGACTTGTG TTCATTGGCA 1920 

GCGGTGATTC TTTCGCGGCC GCACTTGTAG CCGAGCATGC CGGCATAGGC GTCGCACGCG 1980 

ATCCTCTTGA TGTGCTAGTG GCTGGCGTTG ATGGGCCTGG CGACGCTATA CTCCTAAGCG 204 0 



TTGGTGGGCG CTCAAAACGA GTTGTTGACG CGGCTCGTTT CCTGTCTTCA CGTGGCTTTC 2100 

GTATCATAGC GGTCACGGGT AACGAGAGGA GTCCTCTCGC ACGCACAGCA CACGTTACCG 2160 

TGAAGCTCGT CTATTCTGAC CTCGCCTGTG GCATGGGCGC CGCACGCCAT GTCGCTATGC 2220 

TTGCAGCGCT CTCCGCATTG TTCAACGCTA GACCTCGTAT ACCCGAGAAG CTTGTTGAGG 22 80 

AGCCCCTGCC TTTCGACCCT CAGGCTGTGT ACGCGGGTGT GGGCGTTGGT GTAGCCTCTG 2340 

CCCTGTTCAT GGTGTTGAAG ATCTGCGAGT TGCTCGCAGA CTGCGCCACC TGGTGGCATC 2400 

TAGAGCAGTT CGCACACGCA CCTGTCTATG GCACGAGAAG CAATATACTC GTCGTGTATC 24 60 

CGATCCTCGT TGTGAGAGGA GCACGCTAGA GGAGTATCTC TCGGCCTTCC GGGAGGCCGG 2520 

GTTTGAGGTC ACCACTGTAC CCGTGTTGAA CGACCCTTGG TCTACAGCTA TTCTCCACGC 2580 

TACGCTGGCC ATCTCCAGTG CTGCAGAGAC CGCCTTCAGT CGCGGCATTG AGGAGCCGGG 2640 

ATATCGTGCA CATCCCGCGC TTAGCAGGCT AACCAGGCTG ATCTACCTAG AGGAGTAGAA 2700 

CCTCTCGAGG ACCGGTATGT AGTGGTCTAG AGGCTTCCCG TCATGGTGTA TCGCGAGGCC 2760 

TATTCCTGCT CTCCTCGCGC CTTCCACGTT GGGCTCATAA TCATCTATGA ATGCTGTTTT 2 82 0 

CGCTGGGTCC GCGCGAAGGA GTTGCATCGC CGCCTCGTAT ATCTTTGTGT GTGGCTTGCA 2 8 80 

AAAGCCGACA ATATCCCTCG TAACCACCGT ATCCACGAGG TGGGCTAGAT CGTCACGCTC 2 94 0 

TAGAAGTAGA CGTACGCATT CGTAGCACCA GTTGTTCGAG ACTATGCCGA CCAGTATCCC 3000 

GTTTCTCTTG GCCCATCTTA GCAGCTCGTA TGTACCCGGT GCTACGTATA CGCCAGACAG 3060 

CACAGCTGAT TGCAATACCC TTGCTAATGC CTCTGCCCTT GAGGGGGTCG GCGTCAAGCC 3120 

GTGTTTTGCG AGGAGCACGG CAGCCGCATA CACTATACTT TGTTGCACGG AGACATCCAG 3180 

CCTCCACGTG TCCATTACAC GCCTCACGCT ATCCGGCGTC GCGTCGGCCC CTAGGGCACG 3240 

TAGATGTCTG GCAGCAGTCT CGTAGAGAGT CTCCTCGTAC CACTCATTTG TGAGGTAAAT 3300 

GACGCCACCT AAATCCAGCA GGAGTGTAGG GTTACGCGGC AAGGCGCCTC CTCATGTATT 3360 

CGAGGAGGCC GCCCGTTGCC AGAATTTCAG CTACAACACC CCGGAAGGGC GGGAAACGGT 3420 

ACGTCAACAC CCTACCATCC TTCTTGATGA GCTTCGCTAC ACCCTCGTCA AGGTTTATCT 34 80 

CTATCTCGTC GCCCTCCTCG GCCGCCTCCA CGAGCTCTGG GAGCACTATA ACGGGGAGCC 3540 

CGTTGTTAAT CGCGTTACGG TAGAATATTC TCGAGAAGCT CTTCGCTATG ATGGCCTTGA 3600 

CGCCTGCAGC CTTGAGAGCT ATCGCGGCTT GCTCCCTGCT ACTACCCATA CCAAAGTTCC 3660 

TACCCGCGAC CAGCACTACA CCCTTGGACG CCTTCTTGGG GAACTCCGGA TCCAGAGGCT 3720 

CCATAGCATG CTCGGCAAGC TTCTCCGGCT CAGTATATAC CAGGTAGCGG GCAGGGATAA 3780 

TCACGTCGGT GTTGATGTTA TTGCCGTAAT TGAGCACAGG GCCCTTCACG ACACCCAGGT 3840 

TCAAGAGAGG TTCACCACAA GTTTGGCCTC GCTATCCCAG GCTATAATCC AGCTGTTTAC 3 900 

TCGGCCAGCT TCACCCACAC ACTTTTCAAC TCCATTATCC TTGTAGCGCA ATCTACCCTT 3960 

CTGGGTAGCA CAGCGTTAAG CCCATAGTGC CAAGGCGCCA CAATGATGCC CTCCGGCACA 4020 

TTCTCGTCGG GTATCAGCCG GAGGCGTATG GCCCCTCTCT CCGTCTCGAG CCTAGCGTGA 4080 

CCGGCGCCAG CCTCCTTAGG GTTGACTCGT GCGTATAGCT CGCCGCTCAC ATCTAGCATC 4140 

GCGTTTGTAC AGTAGCTCAC CGGGTCTCTT GCAGTCACGA GCACCTTCCT ATCACCATCG 4200 

GGCACGACCG GCTCGACCGG CGGGTATAGA CGGACGCGTA TCCTCGAGAC ACGCCTGGGC 4260 

AGGAGGTACT CGCCTCTCTC CGCAACCGCC TTGGAGGAA 4299 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TGGACTGATA AAGAAAAAGA AGAGGTTTAA GGGCCTCAAT ATTAAATTCT ACACATTAGA 60 

TATCCAAAAT GGAGAATTAC TTAATCTAGA GACTTACCTT AAGGAGTTAC ATGAGTTCCT 120 

TAGAGGCCTT ACATTAAAAC GAAAAGTAGA AGAGGAACAA TGACCCCCGA AGAGCTCCTA 180 

ACCCGCCTCG AATTCAAAGG AGTAACCCTC GAAAAGATGC TCAATACTGC GTTAGAGCTC 240 

TACATCGGCG ACGAGCGCGA GAAAGTTCGA GAAAGGCTGA GAGAGCTGAT GCTGAGGTAT 300 

CTGGGCGACA TCAACGTTCA AGCTCTGCTC TTTTCGGCTC TACTGCTCGA AGAGAACTTC 360 

AAGGTTGAGG GCGACCCCGT GAACCTTGTG GCCGACGAGC TCATCGGCAT GAACATCGCC 42 0 

GAGCTCATAG GTGGAAAGAT GGCGCTCTTC AACTTCTTCT ACTACGACAC CAAGAAGCCC 4 80 

GGCATTTTAG CCGAGCTTCC GCCTTTCCTC GACGATGCGA TAGGGGGCTT TATAGCGGGC 540 

TGTATGACAA GGCTGTTCGA GGGGGTGTAC GGTGCGGAAT CTCTTACCCT TCTTCACGCG 600 

GATTCCGGTC AAAGGCAACT TCAAAAGGGT TAGAAATGAG CTCTGGGCAC TTCCCATTCT 660 

CGCACCGGTA ACTTCGGCCC TGGCGACGCT CGTGGGCTCT GTGCTCGCCG GGGTAATAAT 720 

CCTGGGCGGC AACTACGCGT TTCACCCAAC GTCTCGGCAA CCCACGTGCT GATAACCCTC 780 

ATAGGCTTCG TCGTGGTCTA CAGCATACTG TTCTACATCT GGCTCCACTT CGTCAGGAAG 840 



CTCATCAGGG AGGGCCCCGA ACCGGTTGAG GGTGACGTCA CCGCGAAGCC GACCCCTGCC 900 

GTTAGCGCCG CGGGAGGTGG TCAGTGATGG ACTACGCGAC CGCATGGTTT TACTTCTCCG 960 

CCTTCCTCCT CGGAATGTAC TTAGCGTTTG ATGGCTTCGA CCTTGGCATA GGCGCGTTGC 1020 

TCGCCCTGAT TAAGGACCAG AGGGAGCGCG ACATACTCGT GAACACCATC GCGCCGGTCT 1080 

GGGACGGCAA CGAGGTCTGG TTCATCACCT GGGGTGCCGG GCTCTTCGCG ATGTGGCCGG 114 0 

CGCTCTACGC GACGCTCTTC AGCACGTTCT ACCTTGCCGT CTGGCTGCTC GCGTTCCTGT 1200 

TCATATTCAG GGCTGTCGGC TTTGAGTTCA GGAACAAGAA CAAGGAGCTA TGGGACAAGC 1260 

TCTTCGCTCT CGTCAGCGCG TTAATCCCGC TCGTCATCGG CGTCATAGTC GGCAACCTCA 1320 

TCATGGGAAT TCCCATTGAC GCCAAGGGCT TCCACGGCTC ACTGCTGACG CTCTTCAGGC 13 80 

CCTACCCGCT CATCGTCGGC CTCTTCATAC TCTTCGCGGT GACCTGGCAC GGAGCCAACT 1440 

GGGGCGTCTA CAAAACCACA GGAAAGCTCC AGGAGCAGAT GAGGGAGCTC GCCTTCAAGG 1500 

CCTGGCTCCT GACCGTCGTC TTCCTCCTGC TCACAGTCAT CGGCATGAAA ATCTGGGCCC 1560 

CACTGAGGTT CGAGAGGGCA CTAACGCCGC TTGGGCTCCT CCTAACGGTT GTCATCCTCG 1620 

TGGCAGGACT GCTCGACGGA CAGCTCATCA AGAAAGGGGA GGAGAATTTG GCCTTCTACA 1680 

TCAGCTGGCT GGCCTTCCCG CTCGTTGTGT TCCTCGTCTA CTACACAATG TACCCCTACT 174 0 

GGGTCATCTC GACCACCGAT CCGAACTTCA AGCTCAGCAT ACACGACCTC GCGGCATCTC 1800 

CGCTGACCCT CAAGGCCGTC TTGGGAATCT CGCTGATCCT GGCGGTCATC ATCATGGCCT 1860 

ACACCCTCTA CGTATACAGG GCCTTCGGCG GAAAGGTCAC CGAGGCGGAG GGCTACTACT 1920 

GAGTTCCCCT TTCCTTTTTC GATATTCGAA CTTTTTTAGG GAAAAGTTTA TAATTCGAGT 1980 

CACCTAAGTT CCTTCTGGAA ACCTAAAAAA CGGTGGTCGA AATGCACAGA GGCAGATCTA 2040 

CCGGCTGGCC CTACGACCGG AAGCCGGTCC TCGTCTTCTG GGAAACCACC AAAGCCTGCC 2100 

GGCTCAAGTG CAAGCACTGC AGAGCGGAGG CAATACTCCA GGCACTGCCG GGCGAGCTGA 2160 

ACACGGAGGA GGGAAAGGCC CTCATCGATT CCCTCACCGA CTTCGGAAGG CCCTACCCGA 2220 

TACTCATTCT CACCGGTGGC GACCCGCTCA TGAGGAAGGA CATCTTCGAG CTCATCGAGT 2280 

ACGCCGTTGA GAAGGGCATT CGCGTTGGTC TCGCCCCCGC TGTAACGCCC CTCCTGACCG 2340 

AGGAAACAAT CGAGAGAATC GCGAGGAGCG GAGTTAAGGC GGTAAGCATA AGCCTCGACA 2400 

GCCCGTTTCC AGAAGTTCAC GACGCAATCA GAGGCATAGA AGGGACGTGG GAGAAAACCG 2460 

TCTGGGCCAT CAAGGAGTTC CTGAAACACG GCCTAAGCGT TCAGGTGAAC ACGGTTGTGA 2520 

TGCGCGAGAC CGTTGAAGGA CTGCCCGAGA TGGTGAAACT GCTTAAAGAC CTCGGCGTCG 2580 

AAATCTGGGA GGTCTTCTAC CTCGTCCCGA CCGGGAGGGG CAACTTCGAG AGCGACCTGA 2640 

GGCCGGAGGA GTGGGAGGAC GTCACACACT TCCTCTACGA GGCCTCGAAG CACCTCCTCG 2700 

TGAGGACCAC CGAGGGCCCG ATGTTCAGGC GAGTGGCGAT AATGAGGAAA GCCCTTGAGG 2760 

AGAAGGGATT CGACCCCGAC GAGGTTCTCA AGCCCGGGGA GCTCTACTTC CGGCTGAAGA 2820 

AACGGCTCGT TGAGCTTCTC GGCGAGGGGA ACGAGGCGAG GGCCCAAACT ATGGGAACGC 2 880 

GCGACGGGAA GGGAATAGTC TTCATCGCCT ACAACGGCAA CGTCTACCCG AGCGGTTTCC 2940 

TGCCCTTCAG CGTCGGCAAC GTCCGCGAGA AAAGTTTGGT TGAGATTTAC AGGGAGAGTG 3000 

AACTTATGAA AAAGCTCCGC TCGGCCGAGT TCGAGGGGCG CTGCGGGAGG TGCGAGTTCA 3060 

GGGAAATCTG CGGGGGAAGC AGGGCGAGGG CCTACGCCTA TCGCTTAAAC CCGCTCGCCG 3120 

AAGACCCTGC CTGCCCGTAC GAGCCGGGCT CATACCTAAG GCTCGCCAAA AAGTTCAATC 3180 

TTCACCTTCC GATTGAGATT TTTGGAGCCC AAAAGCCGAT TTGAGGTGAT GGAAATGAGG 3240 

TGGAAGGCTG TTTTACTGAT TGGAATCCTC CTCGTGTCTG TCCTCGGTGC CGGATGCGTT 3300 

GGCTCGAATA CCTCAACTGA AACCGGCCCA TCCCAGAAGG AAATAACCGT GAAGGACTTC 3360 

TCGGGAAGGA ACATCACGGC TAAAGTTCCG GTTCAGCGGG CGGTCGTTCT CTCGACTTCC 3420 

GCCCTCGAAA TAATCCAGCT CCTCAACGCG AGCGACCAGG TCGTCGGTAT TCCAAAGGAG 3480 

GCCCAGTACG ACGCTTTACT GAGCGAAAGC CTGAAGAACA AGACCGTCGT TGGCGCGAGG 3540 

CTCAAGATTG ACGACTGGGA GAAGGTTTTA GCCCTAAAGC CCGACCTAAT CATCGACCTC 3600 

GACCTGAAGA AGTTCTACAA CGTTGACGAG CTCCTCAACC GCTCCGCCAG CTACGGAATT 3660 

CCGGTCGTCC TGCTGAGGGA GGATAACCTT GAGGACATAC CGAAGGCGGT TTCGCTCCTC 3720 

GGTCAGCTCT TCGGAAGGGA GAAAGAGGCC AAGGCCTTCG ACGACTACTT CAACGAGCAG 3780 

GTGAAGGAGG TTAAGGCCAT AGCCTCAAAG ATTCCAGCGG AGGAGAGAAA GAAGGCGATA 3 84 0 

ATGATACAGC CGATAATGGG CAAGCTCTAC CTCGTCAACG GCAACGACGT CCTTGCTCAG 3900 

GCCGTCAGGC TCGTTGGGGC GGACTACCTC GTGAACCTGA CCTTCAACGG CTACACTCCG 3960 

GTTAGGGTCC CGATGGACGG GGAGAAGATA ATAGCGAACT ACCGCGATGC AGACGTCGTA 4020 

ATCCTCCTGA CGAGCGCCGT AACGCCTTAC GACCAGGTCG AGAAGCTCCG GGAGGAGATG 4080 

CTCAGCGACG AGGCCTGGAG GGGCATTAAG GCCGTCAGGG AGGGCAACGT AGTAATCCTC 4140 

AGGGCGGACA TGGGTAAAGA CTCCTTCCTC CGCTGGAGCC CGCGCTTGGC AGTGGGAATC 4200 

TGGGTCATTG GAAAGGCAAT CTACCCGGAC TACTATCCTG ACTGGAACGA CAAGGCCAAG 4260 

GACTTTCTGA AGAGGTTTTA CGGCCTCTCC TGATTTTTCT TTTGGGGTGG GACGATGATA 4320 

GCGGTCTTTC CAGCGAGTCT CGCGGAAATC GTCAAACTCG TCGGGAAAGC CGGGGAGATA 4380 

GCCGGAGTGA ACGAGGAAAT CAGGTTCGAC CCCTGCCTGC CGGAGCTGAA GGATAAGCCT 4440 

GTCATCGGAA AGTACCTCAA GCGGAGCAAG AGGACCTACT GGGACGTTTT AGAGGAGCTT 4500 

AGGCCGGACC TTATCCTCGA CTTCGATGTT GAGAACCTGC ACTCCGGGGA CGAGCTGAGG 4560 

GCCTTTGGGG AGCGTATAGG GGCAAGGGTC GAGCTGATTG ACTTCGAGAC CGTTGAAGGC 4620 

TTCGTCGAGG CGAGCAGGAG GATAGCCGAG CTAACGAGGG GCGACTTTTC AAAGCTCGGC 4680 

GGGTTCTATG AGAAGCACCT GACGAGGCTG GGTGAGATAA CTGAAGCCAT CGAGGAGAGG 4740 



CCTAAAGCCC TGCTCACCTA CCGGAACTTC AACGTCGTAA CGAGGACCAA CGTTCTGAGC 4 800 

GACGCGGTTA GAAAAGCAGG GGCGATGAAC CTCGGCGAGA GGATACGGAC AAAGCGGAAG 4860 

GTCTATCCGG TAAAGAAGGA GCGCTTCTTC AGGTCCTTCG GCGATGCGGA GCACCTCTTC 4 92 0 

CTGCTCACGA GCATAATGAC GGACAGGGAG AAAATGGAGG GGATAAGGGA TGAAATCCTT 4 980 

GACTCGGCCG AGTGGAGGGC AATGGAAGCC GTTCAGCTCG GAAACGTGCA CATAGTTGGC 5040 

TCGGCCCTCG ACCTTGAGAG CTTCATGCGC TGGAGTCCCC GCATAATCCC GGGAATCTAC 5100 

CAGCTTGGAA GGTTTATACA CGGAACAAAT CACCCACGAA TCTCGTGGAA ATCACTGCAA 5160 

AAGTTTAAAA TCCCCCTCCC ACCCCTCGAA GAACAAAAAC GCATCGTCGC CTACCTCGAC 522 0 

TCGATACACG AGCGCGCCCA AAAGCTGGTA AAGCTCTACG AGGAGCGGGA GAAGGAGCTT 52 80 

GAGAAGCTTT TCCCCGCGGT GCTTGATAGG GCGTTTAGGG GTGAGCTGTG ATTCCGGGAA 534 0 

TGGAATACGG CTTTGAGAGG GCAATCTTTG AGATAGTCAG CGGCTTTGTT CTCTCCCTCG 54 00 

TAGTCAGGGC TTTCGCTTAC AGTTTTGGTC TTCCATGGGT ATCCTTTTTG TTCAACGTTC 5460 

TTTCGATACT TCTGACAATA GGCCTGATTG ACAAAATGCC CTTCTGGTCC ATGTCATATC 552 0 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGCTTGGATA TCGAATTCCT TATATGAAAA ATTCATCGAA TTGGTAAAAA ACCACGATCT 60 

TCATGTGGAA ACTGGAATAT TTGCTGCGCA TATGCTTGTG GAAATACATA ACGATGGTCC 120 

GGTGACTTTG TTACTTGATT CAAGAAAAGG TATTTTGAAG TCATCTTTGC TGTCTCTAGG 180 

AGGACTATAT GCCTGAATAC TCGCATAGCA ATAAAAACAA CTTTTTTGCC GAAAACGATG 24 0 

TGAAGAATTG TCATCTACTG CATGTATGTT GTGCACCCGA TTTGGCAATT TCTTATTTGT 3 00 

CCGGTGCACG TGGTGATATT TTCTTTTACA ATCCTAACAT ACATCCAAAA GCTGAATACG 360 

AGAAACGACA CGCCGAAGTG ATTAAAATTG CTGCACTCTT TAAAATGAAT GTTCTGAAAG 42 0 

TTCCTTATAA TCCTGACCTG TTCTTCAAGC TTACTAAAGG ATTAAAAAAT GAACCTGAAG 480 

GCGGGACAAG GTGCGAGATT TGTATAAGAA TGCGACTAGA AAAAACAATG GAATACGCGA 54 0 

AAGAAAATGG CTACAAGAGT GTTTCCACAA CGCTAACAGC CTCTCCAAAG AAAAATGTAG 600 

CGATGATTGT GAAGATAGGA AAAGAACTGG AAAAAAAATA CGGTGTGGAA TTTTTGCCTA 660 

ATGTGTACCG CAAAAGTCCG CTTTACAACG ATGCGCAAAA GCTTATAACG AAAATGGGTT 720 

ATTTACAGAC AAAACTACTG TGGTTGTATT TTCTCAATAA GAACTTCCGT TATAGTAGCC 780 

ACTCAAGAAA CTAAAACCGT AAAAAGTGGG GTCGAAGTAT GAAAATATAC CACAAATTAG 840 

AAGAAGTTGA AGAACATAAG CGGTCGTATG CATCAATTGC TTTTTCATCG AAAGTCAGGG 900 

TTGAATATGA ACATGCTGGC GAAAAACTTG CCCTCATCCC TGTAACTATT GGAGACCTTA 960 

CGGTGGTTAT CGAAATTGAC GATGATAGAG AAGTATTCAA TACTTTGTTG AACGAGCACA 1020 

TCAAAAACTC TATCCTGAAA CAGTTTCCGT ATCCGGAAGA GATTAGAGGG TTAGCCAGAC 1080 

ATTTTCGCAC AGAATTGAAG AATTTCAGAA TCTTGGTTGT AAAATACAAT AGTGTCGAAG 1140 

AAAAGGAATT CTCAAGGTAT TCACTGTCTA ATATAACATT CGGTGTGGTG TCATACAATA 1200 

AATTTGATGT CCATTTGTTA CCAAGTAATG TAAAAGTCAG ACCGAAGCCA GGATACTGTC 1260 

TTTCACATGT TGTCCAAAAG CCTGAAGAAG GTATCAGGCA AGCATTCTTG TTAGCCCGGT 1320 

GGTTTGGTGG TGGAAGCTAC GACCAACTGC CCAAATTAGC GCTTGAAAGC ACTGACATTG 1380 

ACCTTGGAAA GTGGACAAAT ATAGTCAAAT ACATCGTTCT GTCAGATTTT GAAAAGAGGT 1440 

ATTTTTCTGG TATAATAAAA AAGCTAAACG AATTTAGAAG CGAGACATAT TTTGACCCAT 1500 

TTGCTAGGCT TGAAATGATA TCACTTGGCA TAATACTCGC CAAGTCAGAG GGAGGAGGTA 1560 

ACTTTGAACC AGACAGTTAC GATATCATTT AGAGCACTTA CTGAAAATAT AAAATTAGCA 1620 

CGAGTTGTTA TACATACTTT TCTAACATTC CGAGGAGTGT TCGATAAAGA TATATTCGAT 1680 

ACGGAATTGG CTGTAAACGA AGCGATTGCA AACATTATTC AGCATACATA CAAAGGTGAA 1740 

CCAAACTACG TTGTGATGAC GCTCAATTGG ATAGAACCAG ATACACTCGA AGTGTTACTC 1800 

CGCGATTTTG GTCCAAAAGT GGACCCAACG AAAATCAAAC CACGAGATTT AGATGATATC 1860 

AGACCAGGAG GACTCGGAGT TTATATAATT CAACGCATCT TCGACATTAT GGAATTCCGA 1920 

AACGTGAGTC ATGGAAATTT ACTTTATCTA AAACGCTCCT TCTTAATACC TCCTAAAAAG 1980 

CAGGAGCTTG GGAATTTAAA TAATGAACCC TATCGAGAAT ATTGAAAAAA CCGTCAAAAC 2040 

GGGGGAAAGA AGACAAATGG GCTTGCTCAC AGGTTTGACA AAAAATCCAT CTTTCATGTC 2100 

TGCATTTTTT GGCTTTTTGG CAGCACAATT TTTGAAAGTG GTGATATACA AAGATTTCCG 2160 

CGTATTTGGT AGATACGGTG GTATGCCCAG TGCTCATGTT GCAACAACCT CAGCATTAGC 2220 

TTGGGCTGTT GGTTACACTA CAGGTTTTGA TTCACCGCTT ACAGCCATCG CTGCAATTTT 2280 

CCTTGCTATT ACAACAGCTG ATGCTGTTGG TTTACGAAGA AATGTCGACC CCAATAAAGG 2340 



ACATACACTA ATGGAAGCTA TCTATGGCTT CTTACTTGGG TGGATAGTCG CTCTGCTTAC 24 00 

GGTTAAGTTG TATCGATAAT TTTGAATGAG TTGTAGTGAA ATAGCCCAAG TCTTTTTTCG 24 60 

CAATTACATC ATAATGCCAG GAGGGTAATT TACAATGTTT TTTAGATTAC CATTTAAAGT 2520 

TTTTGTTTTT GCAGTTTTGT TGCTTGCCAT CTCGTTAACA AGTGTTGTTA GTTTTGGACA 2580 

AGATGATGAG CAGATAAAAA CACCAAATTG GTTTAGAAGT GCGGTGATTA AGAAAAGAGC 2640 

TGGTATGAAT CTAAAGACCG CCCCAGAGTT TGTAGATGAC CTATGGAATG CGATATACAC 2700 

TATAGGCACA AAATACAACG TTCCCCCAAC GCTTATAGCC GCTGTCATTT CTGTAGAAAG 2760 

CAACTTCGCC AACGTGAAAG GTGCTGGAGA CGTGGTAGGA ATGATGCAAA TTTCTATCTC 2 82 0 

CACAGCCAAA AATATATCGA AACTCCTCGG CCTCGAACAA CCAAAAAACG GTTGGGATGA 2 880 

GCTCCTCACA AATTATTGGT TGAATATAAC TTACGGTACC GCATACATCG CTTATCTTTA 294 0 

CAAAAAGCAT GGAACTTTAC AGAAAGCGCT CGAAGAATAC AACAACGGAA AAAATAAAAC 3 000 

TAAATACGCC CAGCTGATAC TACAACAATA CAACCTATAC GAGAGCCTCC ATTCTGCTGA 3 060 

AATAAGAAAT AACCAGCAAT TGGATACAGA TAATTCTTCG ACATCTTCTG AAGCAACAGA 312 0 

TACTTTGAAT ACAACCAGTG CAACAAATTC ACAACCAACA TCAGATGCAT CAAATACATC 3180 

AGTTAACACT TCAGAAATCA AGTTCCCGCC TCTTTTCGGA GTTGCAGGTT ATTAAGATAT 324 0 

TTGTTCGGTA GTTACTTAGG AATGTGGGGT GTATAGTTTG GAAGATGAAA AAATGAAACC 3 3 00 

TGAAACGATA GTAAAAATTG AACATTTATC TTTTTCTTAC CCGAGTTTCA GTCTCAAAGA 3 3 60 

TGTAAGTTTT GAGGTTCGGA AGGGAAGTTT CTTCGGCATT ATTGGACCAA ATGGTTCGGG 3420 

AAAAACCACG CTACTCTCAC TCATTATGAA ATTCCAAAAG CCAAAAAGTG GGAAAATAAC 34 80 

AGTTGATGGG AACGATGTGC TCAGGCTATC TCACAAAAAA CTTGCACAAC TTATAGCATA 3540 

CATCGCTCAA GACTTTAACC CTACATACGA TTTCACAGTT GAAGAATTGG TCGAAATGGG 3600 

AGGAATCCCC CGCTCACCAC ATTTTTTCGA AACACCTGTT TACGAGGAAG AATTAGAAAA 3660 

TGCACTCAAA ACTGTTGATT TGCTTGAATA CCGAAAAAGA ATATTCTCCA CTCTTAGTGG 3720 

AGGACAACAG CGCAGGGTCT TGATTGCACG CGCAATCTAT CAAAACACAC CTATCATCAT 3780 

TGCTGATGAA TTGGTTAATC ACTTGGATTT AGGGCAAGCA ATTAAAGTGT TAGATTATCT 3840 

AAAACAACTT ACCGAATGTG GAAAGACGAT AATTGGACAT TCCACCTGCA GCCCGG 3 896 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D.) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

ATGTGCTGCA AGGCGATTAA GTTGGTAACG CCAGGTTTTC CCAGTCACGA CGTTGTAAAA 60 

CGACGGCCAG TGAATTGTAA TACGACTCAC TATAGGGCGA ATTGGGTACC GGGCCCCCCC 120 

TCGAGGTCGA CGGTATCGAT AAGCTTGATA TCGAATTCCG TACGAAATGC GGGAAAGAGA 180 

GAAGGAAAAG GAAAGAGAGC ACAGATTTGG AAATGAGACA GAACACGAGG AAGAGCATGG 240 

TATGGCAGAG CGTGAAAGAG CACATGAGAA CGAGTCTGAA GAAATGGGCA AGGGCGTTGG 300 

CATGGGCGCC CATGGAATGA AGATGGGCAA AGAAGCTCGC GAAATGGTGA AGGAAGAATA 360 

CAAGGAAGCA AAGGAGAGAT ACAAGAAGGC TAGAGAAGAG TTTGAAAGAG CAAAGAAGAT 420 

GGGATTGGAC ATCAGAGAGG AGCGCGGATT CAAGATGGCC AAGGGATTCA TGGTAGCTGG 4 80 

ACTAGACGTT GCTGAGATGT GGCTGGAGAG ACTGAAGGTA CAGGTCATGA ATATGGGTGA 540 

AGAGGCCAAG ATCACAGAGG AGACCAAACT GGAGCTGCTC GCAAAGATCG ACGAGAAGCT 600 

TGCAGAAATC AAAGAGCTGA AGAACGAAAT CAATGAGACC TCCTCACCTG AAGAGCTGAT 660 

AGAAACTGTC AAGAAAATCA GAAAGGAGTG GAGAGAAATC AGAGATGAAA TGAGGGCTCT 72 0 

TACTGGCTAT GTCGCCGTTG CCAAGGTGGA AAAGCTTGTT GAAAAGGCCA AGCAGGTAGA 780 

GCTAATGCTT GAGGCAAAGA TCGAGGAGCT CGATGCTGCA GGAGTTGATA CAACCAAACT 840 

CGAGGCAACA CTCGAGGACT TCTCGGCAAA GGTTAATGAA GCAGAAGATT TGATTGACAA 900 

GGCTGAAAAT CTGTTCGAGG AAGGCAACAT TGCTGAAGGA CACATGACTC TCAAGGAAGC 960 

CATAAAGACT CTCAAGGAAG CCTTCAAGGA TGTCAAGGAA GTTGTCAGCG AGATGAAGGA 102 0 

AATGAACCAG TATAGAGTTA GGGAGGGCAA GATCTTCTAC GGAAACGAGA CTGGAGAAGT 1080 

CTGGGTGGAT GGTAATGGTA CTGCTGAGTT TAACGGTACC GGTATCGTTG TGATCAGAGG 1140 

AAACGCAACA CTTGAGGTCG CACCAGAAGA TGCGATCGTG ACACTGGTCG GCTTCGGCGT 12 00 

GAAGAGCGTT GAGGGTGGCG TTTCAAGAGT CAGCGGAGAA GGTAAGGCAG TAATCAGAGG 1260 

AGAAAACCTC ACCGTCAAGG TGGAAGGTGA CGACTTCAAG CTCATAGTGA AGGGCTACGG 132 0 

TACACTCAAA CTCGATGGTG AGGGTGAATA CAGGGTAAAG AAGAGCCCAC AGGAAGAGAT 13 80 

GACATTTAAA CTCTTTCTTC AACTCTAGCA GTTTGAGCAT TGCATTTCCA AGATTTTTGC 1440 

TGTTAGCTTC GGGACAACTT TGAAAATACG TCGAGACAGG CTCAAATGTT GTCCCAGCAT 1500 

TGCAGCTTTC GGCAAAGCGA ACGAGATTTG CGTTCCGCTC CCCAGCCCAA CATGGCTTCT 1560 



GTAATCTGAA AAAACTTCAA GTTCAACAGC TTTCCCAAAA ACATCCAAAA GCTTTTCCGC 162 0 

AACACTTCTA AATCTTTCGA GATTTATTGC ATTTCCTTTC ACCGAAATGC TATCGGATTC 1680 

' TCTTCCCACA ACCTCGATAT GCGGCTCTTC CAGAGCAATA CCCACTCCAC CGTCAATCCT 174 0 

TCCAACCTGG CCGTTCAAAT CAATGAGCGT GATATGAATT CTCGACGGAG TTTTAACCTT 1800 

AACATACATC TATAGAATTT AAACGGTAAT TACTTAAGAA GTTTTGGTTT TGCGAAAAAG 1860 

AGTTCAAAAT TCATTCTTTT AACTGCACTA CAGCTCATCT GTGCCTTTTC TCCTTAATTC 1920 

GATTTTTCTG AGATAGTTCT GGTATCTCGT ATCAACTATG TAAGCCTCGG GAGCTATTAC 1980 

AGGCAGATGA TAACCGGTGA ATATCCTTAT TATCTCTCCA GCCTGAACCG AGCATGTCAG 204 0 

TGCATATGAT ATCGGATCGT GATCGATGTG AGGATACTCC ACCTCGAAGA AAGACACACC 2100 

ATCAGGCAGG AAAGTAGTAA TTATATCGGG AATAAATGGA GCTCCGAGCT CTTCAGCAAC 2160 

TTTTGCAGCC ATTGAAATGT GCTTATGAGC AACAACAACA TCAATACCTT TCAACTGTCT 2220 

CCTGAGTTCT TTATAATCAT GCGGGAAGGG ATAAGAGATT ATACACGAAT CAGAACTCAT 22 80 

AGGATGCACA ACATCATAAT CGTTTGCCTC AAGTGGCTTT ATGCTGGCAT CAAGCCTCAC 2 340 

ATCCATTGGT GTAACTACAT CTCCAATATA CCGAATGCAA CCAACACCAC TTCTCCAGAG 2400 

CAATTCCATG AGCATTCTGC TTCCGATGAC AGCGACACTA AAGTTCCTGA GATAATCTAT 2460 

CTTTTCTTCA TCTGCCATCC CATACCAGGA AATTTTTCTC ATGGCAATAG CCCCGCATCC 2520 

ATTAAATGGT ATTAATTTTT TGCCGTATTT TGAGGAGGTA GATATTAACC AATTATTTTC 2580 

AAACCATTTA AGGGCATCGA TGAAACATCC CAAAACCAGT TCAGCAAAAA ATTAAATCAC 2640 

TGCCACACAT TGAGGACCCC AAAATGGTGT GAGAAATGGA CGAACTGGGA GGAGTTATTT 2700 

TTGATCTGAT AGAAGAGGAG CCCGAAGTTG AGGAGGACGA CGAGATTAAG CTCGCAGAGA 2760 

TATACAGGCT TGCTACAAAA CTTATAAAGT TACTCGAAGA TCTCAAAAGC CATGAGCTTA 2820 

AAGAGTCAGC ATCTCTTATG CTCATAAAGG AAATTATCGG TGAAGACAGA GTTCTGGTTG 2880 

GTTTAGCATC AAAAATGCTC CAGGATATGA GTCTCGGGTT CGAAGAGGAC GAAAAGTACG 2 940 

TTTCTTGATT TTTGAACTGT ATTTTCTACA TGCTCTTTTC CCAACCACAT TCAGTTGCAT 3000 

GCCATACGAA AATTCCAATG CCCAAATCCT GGTAAATGTA CTTTTTCATA GTAAATGCTG 3060 

CCAAACCCAG ATTAAACTCA ATTTCATCAA CAGGAAAAAG AAAGAACGAA AAAAAGACCT 3120 

ACAACAGTCC TATAATTGAC CAAACTTGAT AGATTACAAA CACCACAGTT GGAATCAAAG 3180 

CACAGATGAA AGCTTTCCGG ATTCCTGCAG CC 3212 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...879 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GAA ATA ATA AAC AAA TTT CTA AAA AAA ATT GGA TAT AAG AAA GAT 48 
Met Glu lie lie Asn Lys Phe Leu Lys Lys lie Gly Tyr Lys Lys Asp 
1 5 . 10 15 

GGA GAA GAA AAA AAG GAC AAA TCT AAA ACC AAA ATA AAA ATT GAA GAA 96 
Gly Glu Glu Lys Lys Asp Lys Ser Lys Thr Lys lie Lys lie Glu Glu 
20 25 30 

GAA AAA ACC ATG GAT ATC GAA ATT CCA AAA ATT GAA CCT ACT GAA AAT 144 
Glu Lys Thr Met Asp lie Glu lie Pro Lys lie Glu Pro Thr Glu Asn 
35 40 45 

TTT AAT CGT GAT GAA ATT GTT TTT GAG GAA GAT AAT GCC TAC GGT ATA 192 
Phe Asn Arg Asp Glu lie Val Phe Glu Glu Asp Asn Ala Tyr Gly lie 
50 55 60 

TCC CAC AAA GGA AAT AGA ACA AAC AAC GAA GAC AAT ATT TTA ATT AGA 24 0 

Ser His Lys Gly Asn Arg Thr Asn Asn Glu Asp Asn lie Leu lie Arg 
65 70 75 80 



AAA ATA AAA GAT ACC TAG ATA TTA GCA GTT GCA GAT GGT GTC GGA GGG 2 88 

Lys lie Lys Asp Thr Tyr lie Leu Ala Val Ala Asp Gly Val Gly Gly 
85 90 95 

CAC AGO TCA GGA GAT GTT GCA TCA AAG ATG GCA GTG GAT ATT TTA GAA 336 
His Ser Ser Gly Asp Val Ala Ser Lys Met Ala Val Asp lie Leu Glu 
100 105 110 

AAC ATT ATC ATG GAA AAA TAC AAT GAA AAC CTA TCA ATT GAA GAG ATA 3 84 

Asn lie lie Met Glu Lys Tyr Asn Glu Asn Leu Ser lie Glu Glu lie 
115 120 125 

AAA GAA CTT TTA AAA GAT GCA TAC ATT ACG GCA CAC AAC AAA ATA AAA 432 
Lys Glu Leu Leu Lys Asp Ala Tyr lie Thr Ala His Asn Lys lie Lys 
130 135 140 

GAA AAC GCT ATT GGA GAT AAA GAG GGA ATG GGA ACA ACA CTA ACA ACT 4 80 

Glu Asn Ala He Gly Asp Lys Glu Gly Met Gly Thr Thr Leu Thr Thr 
145 150 155 160 

GCA ATA GTT AAA GGG GAT AAA TGC GTT ATA GCA AAC TGC GGG GAT AGT 528 
Ala He Val Lys Gly Asp Lys Cys Val He Ala Asn Cys Gly Asp Ser 
165 170 175 

AGG GCT TAT TTA ATT AGA GAT GGA GAA ATA GTT TTT AGA ACA AAA GAC 576 
Arg Ala Tyr Leu He Arg Asp Gly Glu He Val Phe Arg Thr Lys Asp 
180 185 190 

CAC TCT TTG GTT CAG GTT TTA GTA GAT GAA GGA CAT ATT TCA GAG GAG 624 
His Ser Leu Val Gin Val Leu Val Asp Glu Gly His He Ser Glu Glu 
195 200 205 

GAC GCA AGG CAT CAT CCA ATG AAA AAT ATC ATT ACC TCA GCA TTG GGA 672 
Asp Ala Arg His His Pro Met Lys Asn He He Thr Ser Ala Leu Gly 
210 215 220 

TTG GAT GAA TTT AAG GTA GAT GAT TAC GAA TGG GAT TTA ATT GAT GGT 720 
Leu Asp Glu Phe Lys Val Asp Asp Tyr Glu Trp Asp Leu He Asp Gly 
225 230 235 240 

GAT GTA TTA TTG ATG AGC TCC GAT GGG CTT CAT GAT TAT GTC AGT AAG 768 
Asp Val Leu Leu Met Ser Ser Asp Gly Leu His Asp Tyr Val Ser Lys 
245 250 255 

GAA GAT ATT TTA AAA ACT GTA AAA AAT AAT GAT CAC CCA AAA GAT ATT 816 
Glu Asp He Leu Lys Thr Val Lys Asn Asn Asp His Pro Lys Asp He 
260 265 270 

GTA GAT GAA TTA TTC AAT ACT GCA TTA AAA GAG ACA AGG GAC AAT GTG 864 
Val Asp Glu Leu Phe Asn Thr Ala Leu Lys Glu Thr Arg Asp Asn Val 
275 280 285 

AGT ATT ATT CGT ATA 87 9 

Ser He He Arg He 
290 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



Met 


Glu 


lie 


He 


Asn 


Lys 


Phe 


Leu 


Lys 


Lys 


He Gly 


Tvr 


Lys 


Lys 


Asp 


1 








5 










10 








15 




Glv 


Glu 


Glu 


Lvs 


Lys 


Asp 


Lys 


Ser 


Lys 


Thr 


Lys lie 


Lys' 


He 


Glu 


Glu 








20 










25 








30 






Glu 


Lys 


Thr 


Met 


Asp 


He 


Glu 


He 


Pro 


Lys 


He Glu 


Pro 


Thr 


Glu 


Asn 






35 










40 








45 








Phe 


Asn 


Arcs 


Asp 


Glu 


He 


Val 


Phe 


Glu 


Glu 


Asn Asn 


Ala 


Tvr 
i / i 


Glv 


He 




50 










55 








60 










Ser* 


His 


Lys 


Glv 


Asn 


ArQ 


Thr 


Asn 


Asn 


Glu 


Asp Asn 


He 


Leu 


He 


Aro 


65 










70 










75 








80 


Lys 


lie 


Lys 


Asp 


Thr 


Tvr 


He 


Leu 


Ala 


Val 


Ala Asp 


Glv 


Val 


Glv 


Glv 










85 










90 








95 




His 


Ser 


Ser 


Glv 


Asp 


Val 


Ala 


Ser 


Lys 


Met 


Ala Val 


Asp 


He 


Leu 


Glu 








100 










105 








110 






Asn 


lie 


lie 


Met 


Glu 


Lvs 


Tvr 


Asn 


Glu 


Asn 


Leu Ser 


He 


Glu 


Glu 


He 






115 










120 








125 








Lys 


Glu 


Leu 


Leu 


Lys 


Asp 


Ala 


Tvr 


He 


Thr 


Ala His 


Asn 


Lvs 


He 


Lvs 




130 










135 








140 










Glu 


Asn 


Ala 


He 


Glv 


Asp 


Lys 


Glu 


Glv 


Met 


Gly Thr 


Thr 


Leu 


Thr 


Thr 


145 










150 










155 








160 


Ala 


lie 


Val 


Lys 


Glv 


Asp 


Lys 


Cvs 


Val 


He 


Ala Asn 


Cys 


Glv 


Asp 


Ser 










165 










170 








175 






Ala 


Tvr 


Leu 


He 


Ara 

nj.y 


Asp 


Glv 


Glu 


He 


Val Phe 


ArQ 


Thr 


Lys 


Asp 








180 










185 








190 






His 


Ser 


Leu 


Val 


Gin 


Val 


Leu 


Val 


Asp 


Glu 


Gly His 


He 


Ser 


Glu 


Glu 






195 










200 








205 








Asp 


Ala 


Arg 


His 


His 


Pro 


Met 


Lys 


Asn 


He 


He Thr 


Ser 


Ala 


Leu 


Gly 




210 










215 








220 










Leu 


Asp 


Glu 


Phe 


Lys 


Val 


Asp 


Asp 


Tyr 


Glu 


Trp Asp 


Leu 


He 


Asp 


Gly 


225 










230 










235 








240 


Asp 


Val 


Leu 


Leu 


Met 


Ser 


Ser 


Asp 


Gly 


Leu 


His Asp 


Tyr 


Val 


Ser 


Lys 










245 










250 








255 




Glu 


Asp 


He 


Leu 


Lys 


Thr 


Val 


Lys 


Asn 


Asn 


Asp His 


Pro 


Lys 


Asp 


He 








260 










265 








270 






Val 


Asp 


Glu 


Leu 


Phe 


Asn 


Thr 


Ala 


Leu 


Lys 


Glu Thr 


Arg 


Asp 


Asn 


Val 






275 










280 








285 








Ser 


lie 

o on 


•He 


Arg 


He 























290 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...951 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

ATG ACT CTG CTA GCC CTG TAT CAG AAT AAA CGT GTT ATC GTC AAG CTT 48 
Met Thr Leu Leu Ala Leu Tyr Gin Asn Lys Arg Val He Val Lys Leu 
15 10 15 

GGC TGG GGG AGC GGC ACT AGC CAA ATA ACT AAC GAG GCG CAA GTG CTG 96 
Gly Trp Gly Ser Gly Thr Ser Gin He Thr Asn Glu Ala Gin Val Leu 



20 



25 



30 



AGC GTA TTG CAC GAT ATG CCT ATA GTG CCC AGA CTG CAT ACC CGT CTA 144 
Ser Val Leu His Asp Met Pro lie Val Pro Arg Leu His Thr Arg Leu 
35 40 45 

GAC TTA GAT GAT GTC AAG CTC GTT GCG ATA GAG TAG ATA CCC TAC AAG 192 
Asp Leu Asp Asp Val Lys Leu Val Ala lie Glu Tyr lie Pro Tyr Lys 
50 55 60 

AGC CTT AAC GCC GTC GGC CGC TTG AAC CCC CTT AAG GCT GTC ACA GCC 24 0 

Ser Leu Asn Ala Val Gly Arg Leu Asn Pro Leu Lys Ala Val Thr Ala 
65 70 75 80 

GTC TTC TAT ACA CTC GCA TCG CTA GTC CAT ATC CAC GGC CGT GGT TTT 2 88 

Val Phe Tyr Thr Leu Ala Ser Leu Val His lie His Gly Arg Gly Phe 
85 90 95 

GCT CAT TGC GAC CTA AAG CCG GGT AAC GTT ATA CCA GTT CCC AAG CGT 336 
Ala His Cys Asp Leu Lys Pro Gly Asn Val lie Pro Val Pro Lys Arg 
100 105 110 

GGC ATG GTG TTC ATC GAC TTT GGT GTT GCA CGA CCT TTT GAC GCT GCG 3 84 

Gly Met Val Phe lie Asp Phe Gly Val Ala Arg Pro Phe Asp Ala Ala 
115 120 125 

GGC TTC GCG GCA GGA ACA CCA GGG TAT ACG TGC CCA GAG GCT CTC GGC 432 
Gly Phe Ala Ala Gly Thr Pro Gly Tyr Thr Cys Pro Glu Ala Leu Gly 
130 135 140 

GGC GAG ACC CCC GGC TCT GGC TGC GAT CTC TAC AGC CTT GCC GGC ATA 4 80 

Gly Glu Thr Pro Gly Ser Gly Cys Asp Leu Tyr Ser Leu Ala Gly He 
145 150 155 160 

TAC TAC TAC TTG GTT ACC GGG TTA AGC CCG CCA CGC GAC CCA AAA GAG 528 
Tyr Tyr Tyr Leu Val Thr Gly Leu Ser Pro Pro Arg Asp Pro Lys Glu 
165 170 175 

TTC GCC AAG GCG CTC TCG TTG GCT CCC GCT CCA AGT AGC CTC TTG GAA 576 
Phe Ala Lys Ala Leu Ser Leu Ala Pro Ala Pro Ser Ser Leu Leu Glu 
180 185 190 

CTG TTC ACA CAG CTG GTG CTG GAT CCC GAG TAT CGT AAC AGC CTT GAT 624 
Leu Phe Thr Gin Leu Val Leu Asp Pro Glu Tyr Arg Asn Ser Leu Asp 
195 200 205 

CCT CTC CAG CTG TTG AAG ATT GTT GCA TCT TTT AAC CCG CAA CTG CTA 672 
Pro Leu Gin Leu Leu Lys He Val Ala Ser Phe Asn Pro Gin Leu Leu 
210 215 220 

GTC CCT CAT ATC GTT ATA GAT GGT GTT TAC AAG CCG CTA GGT TAC GGC 720 
Val Pro His He Val He Asp Gly Val Tyr Lys Pro Leu Gly Tyr Gly 
225 230 235 240 

GAG GTA AGC ATA GGC TCT AGA GGC GTT ATA CGT GTT GAT GGA CGA CCA 768 
Glu Val Ser He Gly Ser Arg Gly Val He Arg Val Asp Gly Arg Pro 
245 250 255 

GTG TAC CTC GCG GTT AAG AGG CAT GTG AGG GGC ACA AGT ATG TAC GCG 816 
Val Tyr Leu Ala Val Lys Arg His Val Arg Gly Thr Ser Met Tyr Ala 
260 265 270 

TAT ACG GAT CTT GTC GTG TTT AGG AGA GGC GAG AAA CTC ATA GTG AGA 864 
Tyr Thr Asp Leu Val Val Phe Arg Arg Gly Glu Lys Leu He Val Arg 
275 280 285 



AGC 
Ser 



GOT GAG AGT ATA GAC CTA GAG TTT 
Gly Glu Ser He Asp Leu Glu Phe 
290 295 



AAC GAC CTG GTG TTG TTC GAC 912 
Asn Asp Leu Val Leu Phe Asp 
300 



AAC CAC ATA CTA TAC GTA TTT ATC CTT CCG GAA AGG CCC 951 
Asn His He Leu Tyr Val Phe He Leu Pro Glu Arg Pro 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Met 


Thr 


Leu 


Leu 


Ala 


Leu 


Tyr 


Gin Asn 


Lys 


Arg 


Val 


He 


Val 


Lys 


Leu 


1 








5 






10 










15 




Glv 


Tro 


Glv 


Ser 


Gly 


Thr 


Ser 


Gin He 


Thr 


Asn 


Glu 


Ala 


Gin 


Val 


Leu 


20 








25 










30 






Ser 


Val 


Leu 


His 


ASD 


Met 


Pro 


He Val 


Pro Arg 


Leu 


His 


Thr 


Arg 


Leu 






35 








40 








45 








Asp 


Leu 


Asp 


Asp 


Val 


Lvs 


Leu 


Val Ala 


He 


Glu 


Tyr 


He 


Pro 


Tyr 


Lys 


50 






55 








60 










Ser 


Leu 


Asn 


Ala 


Val 


Gly 


ArQ 


Leu Asn 


Pro 


Leu 


Lys 


Ala 


Val 


Thr 


Ala 


65 










70 








75 










80 


Val 


Phe 


Tyr 


Thr 


Leu 


Ala 


Ser 


Leu Val 


His 


He 


His 


Gly 


Arg 


Gly 


Phe 








85 








90 










95 




Ala 


His 


Cys 


Asp 


Leu 


Lys 


Pro 


Gly Asn 


Val 


He 


Pro 


Val 


Pro 


Lys 


Arg 






100 








105 










110 






Gly 


Met 


Val 


Phe 


He 


Asp 


Phe 


Gly Val 


Ala 


Arg 


Pro 


Phe 


Asp 


Ala 


Ala 




115 










120 








125 








Gly 


Phe 


Ala 


Ala 


Gly 


Thr 


Pro 


Gly Tyr 


Thr 


Cys 


Pro 


Glu 


Ala 


Leu 


Gly 


130 










135 








140 










Gly 


Glu 


Thr 


Pro 


Gly 


Ser 


Gly 


Cys Asp 


Leu Tyr 


Ser 


Leu 


Ala 


Gly 


He 


145 










150 








155 










160 


Tyr 


Tyr 


Tyr 


Leu 


Val 


Thr 


Gly 


Leu Ser 


Pro 


Pro 


Arg 


Asp 


Pro 


Lys 


Glu 




165 








170 










175 




Phe 


Ala 


Lys 


Ala 


Leu 


Ser 


Leu 


Ala Pro 


Ala 


Pro 


Ser 


Ser 


Leu 


Leu 


Glu 






180 








185 










190 






Leu 


Phe 


Thr 
195 


Gin 


Leu 


Val 


Leu 


Asp Pro 
200 


Glu 


Tyr 


Arg 


Asn 
205 


Ser 


Leu 


Asp 


Pro 


Leu 


Gin 


Leu 


Leu 


Lys 


He 


Val Ala 


Ser 


Phe 


Asn 


Pro 


Gin 


Leu 


Leu 




210 








215 








220 










Val 


Pro 


His 


He 


Val 


He 


Asp 


Gly Val 


Tyr 


Lys 


Pro 


Leu 


Gly 


Tyr 


Gly 


225 










230 






235 










240 


Glu 


Val 


Ser 


He 


Gly 
245 


Ser 


Arg 


Gly Val 


He 
250 


Arg 


Val 


Asp 


Gly 


Arg 
255 


Pro 


Val 


Tyr 


Leu 


Ala 


Val 


Lys 


Arg 


His Val 


Arg Gly 


Thr 


Ser 


Met 


Tyr 


Ala 






260 






265 










270 






Tyr 


Thr 


Asp 


Leu 


Val 


Val 


Phe 


Arg Arg 


Gly Glu 


Lys 


Leu 


He 


Val 


Arg 




275 










280 








285 








Ser 


Gly 
290 


Glu 


Ser 


He 


Asp 


Leu 
295 


Glu Phe 


Asn 


Asp 


Leu 
300 


Val 


Leu 


Phe 


Asp 


Asn 


His 


He 


Leu 


Tyr 


Val 


Phe 


He Leu 


Pro 


Glu 


Arg 


Pro 









305 310 315 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...1320 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45: 

ATG GAC ATC AGG GCC GTT GTT TTT GAC CTC GAC GGG ACG CTT GTG GGT 4 8 

Met Asp lie Arg Ala Val Val Phe Asp Leu Asp Gly Thr Leu Val Gly 
15 10 15 

GCT GAG AAG ACT TTC AGC GAG ATA AAG TCC GAG CTT AAA GAA CGG CTG 96 
Ala Glu Lys Thr Phe Ser Glu lie Lys Ser Glu Leu Lys Glu Arg Leu 
20 25 30 

ATT TCC TTA GGG ATT CCC AGG GAG CTC GTT GGA GAG CTA ACG CCG ATG 144 
lie Ser Leu Gly lie Pro Arg Glu Leu Val Gly Glu Leu Thr Pro Met 
35 40 45 

TAT GAG GGC CTT ATC GAG CTG TCC AGA AAA ACG GGC AGA CCT TTC GAA 192 
Tyr Glu Gly Leu lie Glu Leu Ser Arg Lys Thr Gly Arg Pro Phe Glu 
50 55 60 

GAG ATG TAC TCA- ATT CTC GTC AAT CTT GAA GTT GAA AGG ATA AGG GAC 24 0 

Glu Met Tyr Ser lie Leu Val Asn Leu Glu Val Glu Arg lie Arg Asp 
65 70 75 80 

AGC TTT CTC TTC GAG GGG GCA AGG GAG CTC CTC GAC TTT CTT GTG GGG 288 
Ser Phe Leu Phe Glu Gly Ala Arg Glu Leu Leu Asp Phe Leu Val Gly 
85 90 95 

GAG GGA ATA AAG CTT GCC CTC ATG ACC CGG AGC TCC AGA ATG GCT GCC 33 6 

Glu Gly lie Lys Leu Ala Leu Met Thr Arg Ser Ser Arg Met Ala Ala 
100 105 110 

CTT GAG GCC CTG GAG CTT CAC GGC ATT AAG GAC TAC TTT GAG ATT ATT 384 
Leu Glu Ala Leu Glu Leu His Gly lie Lys Asp Tyr Phe Glu lie lie 
115 120 125 

TCA ACG AGG GAT GAT GTC CCT CCC GAG GAG CTG AAA CCG AAT CCT GGC 432 
Ser Thr Arg Asp Asp Val Pro Pro Glu Glu Leu Lys Pro Asn Pro Gly 
130 135 140 

CAG CTG AGG AGA ATC CTC GGT GAG CTC AAC GTT CAA CCA GAG AAA GCC 480 
Gin Leu Arg Arg lie Leu Gly Glu Leu Asn Val Gin Pro Glu Lys Ala 
145 150 155 160 

ATC GTC GTT GGA GAC CAC GGC TAC GAT GTC ATC CCT GCC CGG GAG CTC 52 8 

He Val Val Gly Asp His Gly Tyr Asp Val He Pro Ala Arg Glu Leu 
165 170 175 

GGC GCT CTG AGC GTC CTT GTC ACC GGC CAC GAG GCT GGC AGA ATG AGC 576 
Gly Ala Leu Ser Val Leu Val Thr Gly His Glu Ala Gly Arg Met Ser 
180 185 190 



TTT CAG GTT GAA GCC GAG CCA AAC TTT GAG GTC GAG AAC CTC ATT CAC 
Phe Gin Val Glu Ala Glu Pro Asn Phe Glu Val Glu Asn Leu He His 
195 200 205 



624 



CTC AGG AAG CTC TTC GAG AGG CTC CTG TCG AGC TAG GTT GTT GTT CCC 672 
Leu Arg Lys Leu Phe Glu Arg Leu Leu Ser Ser Tyr Val Val Val Pro 
210 215 220 

GOT TAG AAC GAG GAG AAG ACC ATC AAG GGG GTA ATA GAG AAT CTT CTC 720 
Ala Tyr Asn Glu Glu Lys Thr lie Lys Gly Val lie Glu Asn Leu Leu 
225 230 235 240 

AGG TAT TTC AAA AAG GAC GAG ATA ATC GTC GTG AAC GAG GGC TCC AGG 76 8 

Arg Tyr Phe Lys Lys Asp Glu lie lie Val Val Asn Asp Gly Ser Arg 
245 250 255 

GAT AGA ACG GAG GAG ATA GCT CGT TCT TAG GGA GTC CAC GTT CTT ACG 816 
Asp Arg Thr Glu Glu lie Ala Arg Ser Tyr Gly Val His Val Leu Thr 
260 265 270 

CAT CTC GTC AAC AGG GGG CTT GGT GGG GCC CTC GGA ACG GGC TTT GCC 864 
His Leu Val Asn Arg Gly Leu Gly Gly Ala Leu Gly Thr Gly Phe Ala 
275 280 285 

TAT GCC ATC AGA AAA AAC GCC AAA CTT GTC CTC ACA TTT GAT GCC GAC 912 
Tyr Ala lie Arg Lys Asn Ala Lys Leu Val Leu Thr Phe Asp Ala Asp 
290 295 300 

GGC CAG CAC CTT ATA AGC GAC GCC CTC CGC GTC ATG AGG CCA GTT GCG 960 
Gly Gin His Leu lie Ser Asp Ala Leu Arg Val Met Arg Pro Val Ala 
305 310 315 320 

GAG GGC AGG GCG GAC TTT GCG GTC GGC TCA AGG CTC AAA GGT GAC ACG 1008 
Glu Gly Arg Ala Asp Phe Ala Val Gly Ser Arg Leu Lys Gly Asp Thr 
325 330 335 

AGC CAG ATG CCC CTC GTG AAG AAG TTC GGC AAC TTC GTT CTA GAT GCC 1056 
Ser Gin Met Pro Leu Val Lys Lys Phe Gly Asn Phe Val Leu Asp Ala 
340 345 350 

GTG ACC GCG GTT TTT GCT GGT AAA TAG GTC AGC GAC AGT CAG AGC GGG 1104 
Val Thr Ala Val Phe Ala Gly Lys Tyr Val Ser Asp Ser Gin Ser Gly 
355 360 365 

TTA AGG TGT CTA AGC GGC GAC TGC CTG AGG AAA ATC AGG ATA ACC TGC 1152 
Leu Arg Cys Leu Ser Gly Asp Cys Leu Arg Lys lie Arg lie Thr Cys 
370 375 380 

GAC CGC TAT GCC GTG TCG AGT GAG ATT ATA ATA GAG GCC TCC AAA GCG 1200 
Asp Arg Tyr Ala Val Ser Ser Glu lie lie He Glu Ala Ser Lys Ala 
385 390 395 400 

GGC TGT AGA ATT GTC GAA GTT CCT ATC AAG GCT GTT TAG ACT GAG TAG 1248 
Gly Cys Arg He Val Glu Val Pro He Lys Ala Val Tyr Thr Glu Tyr 
405 410 415 

TTT ATG AAG AAG GGG ACG AAC GTT TTA GAG GGC GTT AAG ATA GCC CTG 12 96 
Phe Met Lys Lys Gly Thr Asn Val Leu Glu Gly Val Lys He Ala Leu 
420 425 430 

AAC CTT CTC TTT GAC AAA CTG AGG 132 0 

Asn Leu Leu Phe Asp Lys Leu Arg 
435 440 

(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 0 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Met Asp 


He 


Arg 


Ala 


Val 


Val 


Phe 


Asp 


Leu 


Asp 


Gly 


Thr 


Leu 


Val 


Gly 


1 






5 










10 










15 




Ala Glu 


Lvs 


Thr 
20 


Phe 


Ser 


Glu 


He 


Lys 
25 


Ser 


Glu 


Leu 


Lvs 


Glu 
30 


Arcr 


Leu 


lie Ser 


Leu 
35 


Gly 


He 


Pro 


Arq 


Glu 
40 


Leu 


Val 


Gly 


Glu 


Leu 

45 


Thr 


Pro 


Met 


Tyr Glu 


Gly 


Leu 


He 


Glu 


Leu 


Ser Arg 


Lys 

■I 


Thr 


Gly 


Arg 


Pro 


Phe 


Glu 


50 










55 










60 










Glu Met 


Tyr 


Ser 


He 


Leu 


Val 


Asn 


Leu 


Glu 


Val 


Glu 


Arg 


He 


Arg 


Asp 


65 








70 










75 










80 


Ser Phe 


Leu 


Phe 


Glu 


Gly 


Ala 


Arg Glu 


Leu 


Leu 


Asp 


Phe 


Leu 


Val 


Gly 








85 










90 










95 


Glu Gly 


He 


Lys 
100 


Leu 


Ala 


Leu 


Met 


Thr 
105 


Arg 


Ser 


Ser 


Arg 


Met 
110 


Ala 


Ala 


Leu Glu 


Ala 
115 


Leu 


Glu 


Leu 


His 


Gly 
120' 


He 


Lys 


Asp 


Tyr 


Phe 
125 


Glu 


He 


He 


Ser Thr 


Arg 


Asp 


Asp 


Val 


Pro 


Pro 


Glu 


Glu 


Leu 


Lys 


Pro 


Asn 


Pro 


Gly 


130 










135 










140 










Gin Leu 


Arg 


Arg 


He 


Leu^ 


Gly 


Glu 


Leu 


Asn 


Val 


Gin 


Pro 


Glu 


Lys 


Ala 


145 








150 










155 










160 


He Val 


Val 


Gly 


Asp 


His 


Gly 


Tyr Asp 


Val 


He 


Pro 


Ala 


Arg 


Glu 


Leu 








165 










170 










175 




Gly Ala 


Leu 


Ser 


Val 


Leu 


Val 


Thr Gly 


His 


Glu 


Ala 


Gly 


Arg 


Met 


Ser 






180 










185 










190 






Phe Gin 


Val 
195 


Glu 


Ala 


Glu 


Pro 


Asn 
200 


Phe 


Glu 


Val 


Glu 


Asn 
205 


Leu 


He 


His 


Leu Arg 


Lys 


Leu 


Phe 


Glu 


Arg 


Leu 


Leu 


Ser 


Ser 


Tyr 


Val 


Val 


Val 


Pro 


210 










215 










220 










Ala Tyr 


Asn 


Glu 


Glu 


Lys 


Thr 


He 


Lys 


Gly 


Val 


He 


Glu 


Asn 


Leu 


Leu 


225 








230 










235 










240 


Arg Tyr 


Phe 


Lys 


Lys 
245 


Asp 


Glu 


He 


He 


Val 
250 


Val 


Asn 


Asp 


Gly 


Ser 
255 


Arg 


Asp Arg 


Thr 


Glu 
260 


Glu 


He 


Ala 


Arg 


Ser 
265 


Tvr 


Gly 


Val 


His 


Val 
270 


Leu 


Thr 


His Leu 


Val 


Asn 


Arg 


Gly 


Leu 


Gly Gly 


Ala 


Leu 


Gly 


Thr 


Gly 


Phe 


Ala 




275 










280 










285 








Tyr Ala 


He 


Arg 


Lys 


Asn 


Ala 


Lys 


Leu 


Val 


Leu 


Thr 


Phe 


Asp 


Ala 


Asp 


290 










295 










300 










Gly Gin 


His 


Leu 


He 


Ser 


Asp 


Ala 


Leu 


Arg 


Val 


Met 


Arg 


Pro 


Val 


Ala 


305 








310 










315 










320 


Glu Gly 


Arg 


Ala 


Asp 
325 


Phe 


Ala 


Val 


Gly 


Ser 
330 


Arg 


Leu 


Lys 


Gly 


Asp 
335 


Thr 


Ser Gin 


Met 


Pro 
340 


Leu 


Val 


Lys 


Lys 


Phe 
345 


Gly 


Asn 


Phe 


Val 


Leu 
350 


Asp 


Ala 


Val Thr 


Ala 
355 


Val 


Phe 


Ala 


Gly 


Lys 
360 


Tyr 


Val 


Ser 


Asp 


Ser 
365 


Gin 


Ser 


Gly 


Leu Arg 


Cys 


Leu 


Ser 


Gly 


Asp 


Cys 


Leu 


Arg 


Lys 


He 


Arg 


He 


Thr 


Cys 


370 










375 










380 










Asp Arg 


Tyr 


Ala 


Val 


Ser 


Ser 


Glu 


He 


He 


He 


Glu 


Ala 


Ser 


Lys 


Ala 


385 








390 










395 










400 


Gly Cys 


Arg 


He 


Val 
405 


Glu 


Val 


Pro 


He 


Lys 
410 


Ala 


Val 


Tyr 


Thr 


Glu 
415 


Tyr 


Phe Met 


Lys 


Lys 
420 


Gly 


Thr 


Asn 


Val 


Leu 
425 


Glu 


Gly 


Val 


Lys 


He 
430 


Ala 


Leu 


Asn Leu 


Leu 
435 


Phe 


Asp 


Lys 


Leu 


Arg 
440 



















(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

ATG GAA AAT CTT GAA AAA CTC CTT GAA GTG GCA AAG ATG GCA GCC CTT 4 8 

Met Glu Asn Leu Glu Lys Leu Leu Glu Val Ala Lys Met Ala Ala Leu 
15 10 15 

GCC GGA GGA CAG GTA TTA AAG GAA AAC TTC GGA AAG ATT AAG CTT GAA 96 
Ala Gly Gly Gin Val Leu Lys Glu Asn Phe Gly Lys lie Lys Leu Glu 
20 25 30 

AAC ATT GAA GAA AAG GGA GAG AAG GAC TTC GTG AGC TAC GTT GAT AAA 144 
Asn lie Glu Glu Lys Gly Glu Lys Asp Phe Val Ser Tyr Val Asp Lys 
35 40 45 

ACC TCC GAA GAG AGA ATA AAA GAG CTA ATA CTT AAG TTC TTT CCC GAC 192 
Thr Ser Glu Glu Arg lie Lys Glu Leu lie Leu Lys Phe Phe' Pro Asp 
50 55 60 

CAC GAG GTC GTG GGG GAG GAA AGG GGA AAG GAG GGA AAA GAA AGC CCT 240 
His Glu Val Val Gly Glu Glu Arg Gly Lys Glu Gly Lys Glu Ser Pro 
65 70 75 80 

TAC AAA TGG TTC ATA GAC CCC CTT GAT GGG ACC AAG AAC TAC ATA AAG 288 
Tyr Lys Trp Phe lie Asp Pro Leu Asp Gly Thr Lys Asn Tyr lie Lys 
85 90 95 

GGC TTT CCC ATA TTT GCA GTC TCC GTC GGA CTC GTT AAG GAA AAC GAA 336 
Gly Phe Pro lie Phe Ala Val Ser Val Gly Leu Val Lys Glu Asn Glu 
100 105 110 

CCT ATA GTG GGA GCG GTT TAC CTT CCT TAC TTT GAT ACC CTA TAC TGG 3 84 

Pro He Val Gly Ala Val Tyr Leu Pro Tyr Phe Asp Thr Leu Tyr Trp 
115 120 125 

GCT TCA AAG GGA AGG GGA GCC TAT AAA AAC GGG GAG AGG ATA AGC GTA 432 
Ala Ser Lys Gly Arg Gly Ala Tyr Lys Asn Gly Glu Arg He Ser Val 
130 135 140 

AAG GAA AGG GGG GAG CTC AAG CAC GCG GCG GTT GTT TAC GGA TTT CCA 4 80 

Lys Glu Arg Gly Glu Leu Lys His Ala Ala Val Val Tyr Gly Phe Pro 
145 150 155 160 

TCA AGA AGC AGG AGG GAT ATA TCT CTT TAC CTG AAT GTG TTT AAA GAG 528 
Ser Arg Ser Arg Arg Asp He Ser Leu Tyr Leu Asn Val Phe Lys Glu 
165 170 175 

GTC TTT TAC GAA GTA GGT TCC GTT AGG AGG CCC GGG GCC GCA GCG GTT 576 
Val Phe Tyr Glu Val Gly Ser Val Arg Arg Pro Gly Ala Ala Ala Val 
180 185 190 



GAT ATA TGC ATG CTT GCG GAG GGC ATA TTT GAC GGG ATG ATG GAG TTT 624 
Asp He Cys Met Leu Ala Glu Gly He Phe Asp Gly Met Met Glu Phe 
195 200 205 

GAG ATG AAG CCA TGG GAC ATA ACG GCG GGA CTC GTA ATA CTG AAG GAA 672 
Glu Met Lys Pro Trp Asp He Thr Ala Gly Leu Val He Leu Lys Glu 
210 215 220 

GCT GGA GGA TTT TAC ACA CTG AAG GGA GAC CCC TTC GGC ATC TCG GAC 72 0 

Ala Gly Gly Phe Tyr Thr Leu Lys Gly Asp Pro Phe Gly He Ser Asp 
225 230 235 240 



ATA ATA GCG GGA AAC AGG ATG CTC CAC 
He He Ala Gly Asn Arg Met Leu His 
245 

AAT AAA TAC ATG AAT AAT GAA AGC ACG 
Asn Lys Tyr Met Asn Asn Glu Ser Thr 
260 265 



GAC TTC ATT CTC AAG GTT GTG 768 
Asp Phe He Leu Lys Val Val 
250 255 

795 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 



Met 


Glu 


Asn 


Leu 


Glu 


Lys 


Leu 


Leu 


Glu 


Val Ala Lys Met 


Ala 


Ala 


Leu 


1 








5 








10 




15 




Ala Gly 


Gly 


Gin 


Val 


Leu 


Lys 


Glu 


Asn 


Phe Gly Lys He 


Lys 


Leu 


Glu 






20 










25 




30 






Asn 


He 


Glu 


Glu 


Lys 


Gly 


Glu 


Lys 


Asp 


Phe Val Ser Tyr 


Val 


Asp 


Lys 






35 








40 




45 








Thr 


Ser 


Glu 


Glu 


Arg 


He 


Lys 


Glu 


Leu 


He Leu Lys Phe 


Phe 


Pro 


Asp 




50 








55 






60 








His 


Glu 


Val 


Val 


Gly 


Glu 


Glu 


Arg 


Gly 


Lys Glu Gly Lys 


Glu 


Ser 


Pro 


65 








70 








75 






80 


Tyr 


Lys 


Trp 


Phe 


He 


Asp 


Pro 


Leu 


Asp 


Gly Thr Lys Asn 


Tyr 


He 


Lys 




85 










90 




95 




Glv Phe 


Pro 


He 


Phe 


Ala 


Val 


Ser 


Val 


Gly Leu Val Lys 


Glu 


Asn 


Glu 








100 










105 




110 






Pro 


He 


Val 


Gly 


Ala 


val 


Tyr 


Leu 


Pro 


Tyr Phe Asp Thr 


Leu 


Tyr 


Trp 






115 








120 




125 






Val 


Ala 


Ser 


Lys 


Gly 


Arg 


Gly 


Ala 


Tyr 


Lys 


Asn Gly Glu Arg 


He 


Ser 




130 




135 






140 








Lys 


Glu 


Arg 


Gly 


Glu 


Leu 


Lys 


His 


Ala 


Ala Val Val Tyr Gly 


Phe 


Pro 


145 






150 








155 






160 


Ser 


Arg 


Ser 


Arg 


Arg 


Asp 


He 


Ser 


Leu 


Tyr Leu Asn Val 


Phe 


Lys 


Glu 






165 










170 




175 


Val 


Val 


Phe 


Tyr 


Glu 


Val 


Gly 


Ser 


Val 


Arg 


Arg Pro Gly Ala Ala 


Ala 






180 










185 




190 






Asp 


He 


Cys 


Met 


Leu 


Ala 


Glu 


Gly 


He 


Phe Asp Gly Met 


Met 


Glu 


Phe 




195 










200 




205 






Glu 


Glu 


Met 


Lys 


Pro 


Trp 


Asp 


He 


Thr 


Ala 


Gly Leu Val He 


Leu 


Lys 




210 








215 






220 








Ala Gly 


Gly 


Phe 


Tyr 


Thr 


Leu 


Lys 


Gly 


Asp Pro Phe Gly 


He 


Ser 


Asp 


225 








230 








235 






240 


He 


He 


Ala 


Gly 


Asn 


Arg 


Met 


Leu 


His 


Asp Phe He Leu 


Lys 


Val 


val 








245 










250 




255 





Asn Lys Tyr Met Asn Asn Glu Ser Thr 
260 265 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

ATG AGT GAA CAG CCG GTA TTG TCT GTT CAA GGA TTA AGC GGC GGG TAT 48 
Met Ser Glu Gin Pro Val Leu Ser Val Gin Gly Leu Ser Gly Gly Tyr 
15 10 15 

AGC ATG AAC CGA CCG GTT CTG CAT GAC GTA ACC TTT CAG GTT GAA CCG 96 
Ser Met Asn Arg Pro Val Leu His Asp Val Thr Phe Gin Val Glu Pro 
20 25 30 

GGT GAG ATG GTG GGT TTG ATC GGC CTG AAC GGT GCG GGC AAG AGT ACC 144 
Gly Glu Met Val Gly Leu lie Gly Leu Asn Gly Ala Gly Lys Ser Thr 
35 40 45 

ACG ATG AAG CAT ATT CTC GGG CTG ATG AAT CCG CAA AAA GGG AGC ATT 192 
Thr Met Lys His lie Leu Gly Leu Met Asn Pro Gin Lys Gly Ser lie 
50 55 60 

CAG GTT CAA GGA AAG AGC CGG ACA GAG CAT TCG GAA GCC TAT CAC GGC 240 
Gin Val Gin Gly Lys Ser Arg Thr Glu His Ser Glu Ala Tyr His Gly 
65 70 75 80 

GCC TTG GCG TTT GTT CCC GAA TCC CCG CTG CTG TAT GAG GAG ATG ACA 288 
Ala Leu Ala Phe Val Pro Glu Ser Pro Leu Leu Tyr Glu Glu Met Thr 
85 90 95 

GTA CGA GAG CAT CTG GAA TTT ACG GCG CGC TCC TAT GGC GTA TCC CGT 336 
Val Arg Glu His Leu Glu Phe Thr Ala Arg Ser Tyr Gly Val Ser Arg 
100 105 110 

GAA GAT TAT GAG GCA CGT TCG GAG CAG CTG TCG AAG ATG TTC CGT ATG 384 
Glu Asp Tyr Glu Ala Arg Ser Glu Gin Leu Ser Lys Met Phe Arg Met 
115 120 125 

GAA GAG AAG ATG GAC AGC CTG TCC ACG CAT TTG TCC AAA GGG ATG CGC 432 
Glu Glu Lys Met Asp Ser Leu Ser Thr His Leu Ser Lys Gly Met Arg 
130 135 140 

CAA AAA GTG ATG ATC ATG TGC GCA TTC GTA GCC AGA CCG TCC CTG TAC 480 
Gin Lys Val Met He Met Cys Ala Phe Val Ala Arg Pro Ser Leu Tyr 
145 150 155 160 

ATC ATT GAC GAG CCC TTT CTT GGG CTT GAT CCG CTT GGG ATA CGC TCG 52 8 

He He Asp Glu Pro Phe Leu Gly Leu Asp Pro Leu Gly He Arg Ser 
165 170 175 



CTG CTT GAC TTC ATG CTG GAG CTG AAG GCA TCC GGC GCT TCG GTA TTG 
Leu Leu Asp Phe Met Leu Glu Leu Lys Ala Ser Gly Ala Ser Val Leu 



576 



180 



185 



190 



CTA AGC TCC CAC ATT 
Leu Ser Ser His lie 
195 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 50 : 






Met 


Ser 


Glu 


Gin 


Pro 


Val 


Leu 


Ser 


Val 


Gin 


Gly Leu 


Ser Gly Gly Tyr 










5 










10 




15 




Ser 


Met 


Asn 


Arg 


Pro 


Val 


Leu 


His 


Asp 


Val 


Thr Phe 


Gin Val Glu 


Pro 








20 










25 






30 




Gly 


Glu 


Met 


Val 


Gly 


Leu 


He 


Gly 


Leu 


Asn 


Gly Ala 


Gly Lys Ser 


Thr 




35 










40 








45 




Thr 


Met 


Lys 


His 


He 


Leu 


Gly 


Leu 


Met 


Asn 


Pro Gin 


Lys Gly Ser 


He 




50 








55 








60 






Gin 


Val 


Gin 


Gly 


Lys 


Ser 


Arg 


Thr 


Glu 


His 


Ser Glu 


Ala Tyr His 


Gly 


65 






70 










75 




80 


Ala 


Leu 


Ala 


Phe 


Val 


Pro 


Glu 


Ser 


Pro 


Leu 


Leu Tyr 


Glu Glu Met 


Thr 










85 










90 




95 




Val 


Arg 


Glu 


His 


Leu 


Glu 


Phe 


Thr 


Ala 


Arg 


Ser Tyr 


Gly Val Ser 


Arg 






100 










105 






110 




Glu 


Asp 


Tyr 


Glu 


Ala 


Arg 


Ser 


Glu 


Gin 


Leu 


Ser Lys 


Met Phe Arg 


Met 




115 










120 








125 




Glu 


Glu 


Lys 


Met 


Asp 


Ser 


Leu 


Ser 


Thr 


His 


Leu Ser 


Lys Gly Met 


Arg 




130 








135 








140 






Gin 


Lys 


Val 


Met 


He 


Met 


Cys 


Ala 


Phe 


Val 


Ala Arg 


Pro Ser Leu 


Tyr 


145 








150 










155 




160 


He 


He 


Asp 


Glu 


Pro 


Phe 


Leu 


Gly 


Leu 


Asp 


Pro Leu 


Gly He Arg 


Ser 








165 










170 




175 




Leu 


Leu 


Asp 


Phe 


Met 


Leu 


Glu 


Leu 


Lys 


Ala 


Ser Gly 


Ala Ser Val 


Leu 






180 










185 






190 




Leu 


Ser 


Ser 


His 


He 






















195 























(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(ix) FEATURE: . 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: l..,1482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



ATG AAG AAA ATA ACT ATT AGT AGT TTG CTT CTA CTT TTA CTT ATT TCT 
Met Lys Lys He Thr He Ser Ser Leu Leu Leu Leu Leu Leu He Ser 
15 10 15 



ACC AAT TTG AAT CTC GCA TAG GAT TCC CAA GAG AGO GGT ATT AAA AAT 96 
Thr Asn Leu Asn Leu Ala Tyr Asp Ser Gin Glu Ser Gly lie Lys Asn 
20 25 30 

ATA ATA ATC CTC ATT GGA GAC GGC ATG GGA ATG AGT CAT GTC CAG ATT 144 
He He He Leu He Gly Asp Gly Met Gly Met Ser His- Val Gin He 
35 40 45 

ACA AAG CTT GTT TAT GGT CAT CTA AAC ATG GAA GAG TTC CCA ATT ATT 192 
Thr Lys Leu Val Tyr Gly His Leu Asn Met Glu Glu Phe Pro He He 
50 55 60 

GGA TTC GAA CTT ACT GAG TCA TTA AGT GGG GAA GTT ACG GAC TCC GCT 24 0 

Gly Phe Glu Leu Thr Glu Ser Leu Ser Gly Glu Val Thr Asp Ser Ala 
65 70 75 80 

GCA GCA GGA ACT GCA ATA GCA ACT GGA GTC AAA ACA TAT AAT CGA ATG 2 88 

Ala Ala Gly Thr Ala He Ala Thr Gly Val Lys Thr Tyr Asn Arg Met 
85 90 95 



ATT TCA GTT ACT AAC ATA ACT GGA AAA GTT ACA AAT CTA ACT ACC TTG 
He Ser Val Thr Asn He Thr Gly Lys Val Thr Asn Leu Thr Thr Leu 
100 105 110 



336 



CTT GAA ATA GCC CAG GTA CTT GGA AAA TCA ACT GGA CTT GTG ACT ACT 384 
Leu Glu He Ala Gin Val Leu Gly Lys Ser Thr Gly Leu Val Thr Thr 
115 120 125 

ACT AGA ATT ACA CAC GCA ACC CCT GCA GTA TTT GCT TCC CAC GTT CCT 432 
Thr Arg He Thr His Ala Thr Pro Ala Val Phe Ala Ser His Val Pro 
130 135 140 

GAC AGA GAT ATG GAA GAG GAA ATA GCG AGA CAG CTC ATA GCT CAC CGG 4 80 

Asp Arg Asp Met Glu Glu Glu He Ala Arg Gin Leu He Ala His Arg 
145 150 155 160 

GTC AAC GTC CTA TTA GGT GGA GGG AGA AAG AAA TTT GAC GAG AAT ACC 52 8 

Val Asn Val Leu Leu Gly Gly Gly Arg Lys Lys Phe Asp Glu Asn Thr 
165 170 175 

CTA AAA ATG GCA AAA GAA CAG GGA TAT AAT ATA GTC TTC ACG AAA GAA 576 
Leu Lys Met Ala Lys Glu Gin Gly Tyr Asn He Val Phe Thr Lys Glu 
180 185 190 

GAG CTC GAG AAA GCA GAG GGT GAG TTT ATT CTA GGG CTT TTT GCA GAT 624 
Glu Leu Glu Lys Ala Glu Gly Glu Phe He Leu Gly Leu Phe Ala Asp 
195 200 205 

AGC CAC ATT CCT TAC GTA TTG GAC AGA AAA CCA GAA GAT GTT GGA CTT 672 
Ser His He Pro Tyr Val Leu Asp Arg Lys Pro Glu Asp Val Gly Leu 
210 215 220 

TTG GAA ATG ACT AAA AAA GCA ATT TCA ATA CTA GAG AAA AAT CCA AAT 72 0 

Leu Glu Met Thr Lys Lys Ala lie Ser He Leu Glu Lys Asn Pro Asn 
225 230 235 240 

GGG TTC TTT CTC ATG ATT GAA GGG GGC AGA ATT GAT CAT GCA GCT CAT 768 
Gly Phe Phe Leu Met He Glu Gly Gly Arg He Asp His Ala Ala His 
245 250 255 

GAG AAT GAT ATA GCA TCA GTT GTT GCA GAG ACT AAG GAG TTT GAT GAC 816 
Glu Asn Asp He Ala Ser Val Val Ala Glu Thr Lys Glu Phe Asp Asp 
260 265 270 



GTT GTT GGA TAT GTT CTT GAG TAT GCA AAA AAG AGG GGA GAT ACA CTA 864 
Val Val Gly Tyr Val Leu Glu Tyr Ala Lys Lys Arg Gly Asp Thr Leu 
275 280 285 

GTA ATA GTG CTG GCT GAG CAT GAG ACA GGG GGG CTT GGA TTA GGT CTA 912 
Val lie Val Leu Ala Asp His Glu Thr Gly Gly Leu Gly Leu Gly Leu 
290 . 295 300 

ACA TAT GGA GAT GCA ATT AAT GAA GAT GTC ATC AGG AAC ATA AAC GCT 960 
Thr Tyr Gly Asp Ala lie Asn Glu Asp Val lie Arg Asn lie Asn Ala 
305 310 315 320 

AGT GTG TCG AAA ATT GCT AGT GAA ATA AGG GCA ACG AAT GAC ATA AAG 1008 
Ser Val Ser Lys lie Ala Ser Glu lie Arg Ala Thr Asn Asp lie Lys 
325 330 335 

AGA GTT ATC AAA AAA TAT ACT GGA TTC GAG CTA ACA GAG GAC GAA ATT 1056 
Arg Val lie Lys Lys Tyr Thr Gly Phe Glu Leu Thr Glu Asp Glu He 
340 345 350 

AAT TAC ATT GAG GAA GCT ATA AAC TTA GCA GAC GAA TAT GCG CTT CAA 1104 
Asn Tyr He Glu Glu Ala He Asn Leu Ala Asp Glu Tyr Ala Leu Gin 
355 360 365 

AAT GCA ATA GCT GAT ATT ATA AAC AAA CGC GTT GGT GTA GGT TTT GTA 1152 
Asn Ala He Ala Asp He He Asn Lys Arg Val Gly Val Gly Phe Val 
370 375 380 

TCC CAC AAA CAT ACA GGA GCT CCT GTT TCA CTT CTA GCC TAC GGC CCA 1200 
Ser His Lys His Thr Gly Ala Pro Val Ser Leu Leu Ala Tyr Gly Pro 
385 390 395 400 

GGT GCA GAG AAT TTT GCA GGC TTT TTA CAC CAT GTA GAT ACG GCA AAG 124 8 
Gly Ala Glu Asn Phe Ala Gly Phe Leu His His Val Asp Thr Ala Lys 
405 410 415 

CTA ATT GCC AAG CTA ATG CTC TTT GGG AAG AAA GAT ATT CCC GTT ACC 12 96 
Leu He Ala Lys Leu Met Leu Phe Gly Lys Lys Asp He Pro Val Thr 
420 425 430 

ATC TTG GGA ATA AGT GGA GTT AAA GGA GAT ATA ACC GGA GAC TTC AAA 1344 
He Leu Gly He Ser Gly Val Lys Gly Asp He Thr Gly Asp Phe Lys 
435 440 445 

GTG GAT GAG CAA GAT GCA TAT GTG ACC TTA ATG ATG TTG CTT GGG GAA 13 92 
Val Asp Glu Gin Asp Ala Tyr Val Thr Leu Met Met Leu Leu Gly Glu 
450 455 460 

AGG GTA GAT ACT GAA CTT GAA AGG AAA GTC GAC ATG AAT AAT AAC GGC 1440 
Arg Val Asp Thr Glu Leu Glu Arg Lys Val Asp Met Asn Asn Asn Gly 
465 470 475 480 

ATA ATC GAG TTG GGA GAC GTG CTC CTG ATT CTA CAA GAG TCC 14 82 

He He Glu Leu Gly Asp Val Leu Leu He Leu Gin Glu Ser 
485 490 

(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION 


Met 


Lys 


Lys 


He 


Thr 
5 


He 


Ser 


Ser 


1 

Thr 


Asn 


Leu 


Asn 


Leu 


Ala 


Tyr 


Asp 








20 










lie 


He 


He 


Leu 


He 


Gly 


Asp 


Gly 






35 










40 


Thr 


Lys 


Leu 


Val 


Tyr 


Gly 


His 


Leu 




50 










55 




Gly 


Phe 


Glu 


Leu 


Thr 


Glu 


Ser 


Leu 












70 






Ala 


Ala 


Gly 


Thr 


Ala 


He 


Ala 


Thr 








85 








lie 


Ser 


Val 


Thr 


Asn 


He 


Thr 


Gly 








100 










Leu 


Glu 


He 


Ala 


Gin 


Val 


Leu 


Gly 






115 










120 


Thr 


Arg 


He 


Thr 


His 


Ala 


Thr 


Pro 




130 










135 




Asp 


Arg 


Asp 


Met 


Glu 


Glu 


Glu 


He 


1 y1 C 

14b 










150 






val 


Asn 


Val 


Leu 


Leu 


Gly 


Gly 


Gly 










165 








Leu 


Lys 


Met 


Ala 


Lys 


Glu 


Gin 


Gly 








180 










Glu 


Leu 


Glu 


Lys 


Ala 


Glu 


Gly 


Glu 






195 










200 


Ser 


His 


He 


Pro 


Tyr 


Val 


Leu 


Asp 




210 










215 




Leu 


Glu 


Met 


Thr 


Lys 


Lys 


Ala 


He 


o o c 










230 






Gly 


Phe 


Phe 


Leu 


Met 


He 


Glu 


Gly 










245 








Glu 


Asn 


Asp 


He 


Ala 


Ser 


Val 


Val 








260 










Val 


Val 


Gly 


Tyr 


Val 


Leu 


Glu 


Tyr 






275 










280 


Val 


He 


Val 


Leu 


Ala 


Asp 


His 


Glu 




290 










295 




Thr 


Tyr 


Gly 


Asp 


Ala 


He 


Asn 


Glu 


"5 A C 

3 05 










310 






Ser 


Val 


Ser 


Lys 


He 


Ala 


Ser 


Glu 










325 








Arg 


Val 


He 


Lys 


Lys 


Tyr 


Thr 


Gly 








340 










Asn 


Tyr 


He 


Glu 


Glu 


Ala 


He 


Asn 




355 










360 


Asn 


Ala 


He 


Ala 


Asp 


He 


He 


Asn 




370 










375 




Ser 


His 


Lys 


His 


Thr 


Gly 


Ala 


Pro 


n o c 
3 85 










390 






Gly 


Ala 


Glu 


Asn 


Phe 


Ala 


Gly 


Phe 








405 








Leu 


He 


Ala 


Lys 


Leu 


Met 


Leu 


Phe 








420 










He 


Leu 


Gly 


He 


Ser 


Gly 


val 


Lys 






435 










440 


Val 


Asp 


Glu 


Gin 


Asp 


Ala 


Tyr 


Val 




450 










455 




Arg 


Val 


Asp 


Thr 


Glu 


Leu 


Glu 


Arg 


465 










470 






He 


He 


Glu 


Leu 


Gly 


Asp 


Val 


Leu 



SEQ ID NO: 52: 



Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Tl 

X xe 


Ser 




lU 










X 3 




Ser 


(jin 


vjIU 


Ser 


Ljiy 


1 le 


Lys 


Asn 


z b 










O U 






Met 


tjiy 


Mec 


Ser 


His 


vai 


oin 


Tl » 

lie 








4b 








Asn 


Met 


CjIU 


(jlU 


T) A 

pne 


Pro 


T 1 A 

lie 


T 1 0 
lie 








o u 










Ser 


Gly 


(ilU 


vai 


rpl_ „ 

inr 


Asp 


Ser 


Ala 

Aia 
















fiO 
0 w 


vjiy 


vai 


Lys 


inr 


Tyr 


Asn 


Arg 


Mec 




on 














Lys 


vai 


inr 


Asn 


Leu 


inr 


inr 


Leu 


xUb 










X X V 






Lys 


Ser 


inr 


vjiy 


Leu 


vai 


inr 


inr 










1 9 R 

X ^ ^ 










vci± 


file 


an a 

A±a 


Ser 


flXo 


\7a 1 
VaX 




















AX a 


■ 

Arg 


v7xn 


Leu 


X xe 


Axa 


fix S 


Arg 






X ^ ^ 










160 


Arg 


Lys 


Lys 


Pile 


Asp 


VjrX U, 


Asn 


1 nx 














175 




Tyr 


Asn 


Tl o 


V aX 


irnc 


X XIX 


Lys 


ni 11 

VJX LI 


X O 3 










1 QO 
X ^ u 






TJ V» A 

fne 


lie 


Leu 


vjiy 


T 

Leu 


pne 


al a 

AX a 


Asp 










90R 

4!« V 3 








Arg 


Lys 


Pro 


VjxU 


Asp 


V aX 


oxy 


_ 

Leu 








^ ^ V 










Ser 


lie 


Leu 


olU 


Lys 


Asn 


Pro 


Asn 






9 7 
Z J 3 










9 AO 


Gly 


Arg 


T 1 A 

lie 


Asp 


HIS 


Aia 


A 1 13 

Aia 


flxS 




o c n 
z b U 










9 i; 




Ala 


(jIU 


inr 


Lys 


^1 11 

(jIU 


T) V» A 

pne 


Asp 


Asp 


O £ c: 
^ Ob 










9 70 






TV T ^ 

Ala 


Lys 


Lys 


Arg 


vjiy 


Asp 


inr 


Leu 










^ O 3 








inr 


Vjriy 


Cjiy 


Leu 


(jrXy 


Leu 


Vjxy 


Leu 








T n A 










ASp 


vai 


Tl A 

lie 


Arg 


Asn 


Tl ^ 

lie 


Asn 


al a 
AX a 
















'^90 

J A v^ 


lie 


Arg 


Aia 


inr 


Asn 


Asp 


Tl 0 

lie 


Lys 




J J w 










335 




pne 


olU 


Leu 


inr 


VjIU 


Asp 


^^1 11 

olU 


Tl *a 

X xe 


7 A cr 
j4b 










SO 
J 3 u 






Leu 


TV 1 -a 

Ala 


ASp 


CjIU 


Tyr 


aia 
Aia 


Leu 


oin 










job 








Lys 


Arg 


vai 


vjiy 


\ra 1 
vai 


oxy 


irne 


\7a 1 
V aX 








1 ft 0 
J o u 










Val 


Ser 


T At t 

Leu 


T All 

Leu 


Aia 


Tyr 


r*i -^r 
vjiy 


Pro 






J y b 










d. 00 

ffc u u 


Leu 


His 


His 


Val 


Asp 


Thr 


Ala 


Lys 




410 










415 




Gly 


Lys 


Lys 


Asp 


He 


Pro 


Val 


Thr 


425 










430 






Gly 


Asp 


He 


Thr 


Gly 


Asp 


Phe 


Lys 










445 








Thr 


Leu 


Met 


Met 


Leu 


Leu 


Gly 


Glu 








460 










Lys 


Val 


Asp 


Met 


Asn 


Asn 


Asn 


Gly 






475 










480 


Leu 


He 


Leu 


Gin 


Glu 


Ser 







485 



490 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...954 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

ATG ATT AAC CAA ATA AAC TTC AAA ACC TCT CAT GGA GGA AGC AGA GAA 4 8 

Met He Asn Gin He Asn Phe Lys Thr Ser His Gly Gly Ser Arg Glu 
15 10 15 

GAA GGC TAC ATA AAC TTC TCG GCC TCT GTA AAT CCT TAT CCA CCA GAA 96 
Glu Gly Tyr He Asn Phe Ser Ala Ser Val Asn Pro Tyr Pro Pro Glu 
20 25 30 

TGG ACT GAT GAA ATG TTT GAG AGG GCT AAA AAG ATA AGC ACC TTC TAT 144 
Trp Thr Asp Glu Met Phe Glu Arg Ala Lys Lys He Ser Thr Phe Tyr 
35 40 45 

CCT TAC TAT GAA AAG CTT GAG GAA GAA CTC TCA GAT CTA ATT GGG GAG 192 
Pro Tyr Tyr Glu Lys Leu Glu Glu Glu Leu Ser Asp Leu He Gly Glu 
50 55 60 

CCA ATA ACT ATA ACT GCA GGA ATA ACA GAG GCA CTT TAC CTG CTT GGA 240 
Pro He Thr He Thr Ala Gly He Thr Glu Ala Leu Tyr Leu Leu Gly 
65 70 75 80 

GTT TGG ATG AGG GGT CGG AAA GTA ATA ATC CCG AAG CAC ACC TAT GGG 288 
Val Trp Met Arg Gly Arg Lys Val He He Pro Lys His Thr Tyr Gly 
85 90 95 

GAA TAC GAG AGG ATC TCA CGC ATG TTC GGA GGT AGG GTG ATC AAA GGT 3 36 

Glu Tyr Glu Arg He Ser Arg Met Phe Gly Gly Arg Val He Lys Gly 
100 105 110 

CCC AAT GAC CCA GGA AAG TTA GCA GAA TTT GTT GAA AGA AAT TCA TTC 384 
Pro Asn Asp Pro Gly Lys Leu Ala Glu Phe Val Glu Arg Asn Ser Phe 
115 120 125 

GTG TTC TTC TGC AAT CCA AAC AAT CCA GAT GGA AAG TTC TAC CGA GAA 4 32 

Val Phe Phe Cys Asn Pro Asn Asn Pro Asp Gly Lys Phe Tyr Arg Glu 
130 135 140 

AAA GAG ATG AAA CCT CTT TTA GAT GCC ATT CAA GAC ACT AAC TCA ATT 4 80 

Lys Glu Met Lys Pro Leu Leu Asp Ala He Gin Asp Thr Asn Ser He 
145 150 155 160 

TTG ATC TTG GAT GAA GCC TTC ATA GAC TTT GTT AAG AAA CCA GAA AGC 528 
Leu He Leu Asp Glu Ala Phe He Asp Phe Val Lys Lys Pro Glu Ser 
165 170 175 

CCA GAG GGA GAG AAC ATA ATC AGG CTA AGG ACT TTT ACC AAA AGC TAC 576 
Pro Glu Gly Glu Asn He He Arg Leu Arg Thr Phe Thr Lys Ser Tyr 
180 185 190 



GGG CTC CCA GGG GTA AGG GTT GGA TAT GTT ATT GGA TTT GTC GAT GCT 624 
Gly Leu Pro Gly Val Arg Val Gly Tyr Val lie Gly Phe Val Asp Ala 
195 200 205 



TTC AGG AGC GTT AGA ATG CCA TGG TCA ATT GGC TCT ACT GGG GTG GCC 672 
Phe Arg Ser Val Arg Met Pro Trp Ser lie Gly Ser Thr Gly Val Ala 
210 215 220 

TTC TTA GAG TTC TTA CTC AAA GAT AAC TTC AAA CAC TTA AGA AAA ACC 720 
Phe Leu Glu Phe Leu Leu Lys Asp Asn Phe Lys His Leu Arg Lys Thr 
225 230 235 240 

CTC CCC CTA ATA TGG AAA GAA AAG GAG AGG ATT GAG AAA GAA TTG AAA 768 
Leu Pro Leu lie Trp Lys Glu Lys Glu Arg He Glu Lys Glu Leu Lys 
245 250 255 

GTT AAA AGC GAT GCA AAT TTC TTC ATT ATG AAG GTC AGA GAA GGA ATA 816 
Val Lys Ser Asp Ala Asn Phe Phe He Met Lys Val Arg Glu Gly He 
260 265 270 

ATT GAA AAG CTA AAA GAG AAT GGC ATC CTT GTA AGG GAT TGC AAG AGC 864 
lie Glu Lys Leu Lys Glu Asn Gly He Leu Val Arg Asp Cys Lys Ser 
275 280 285 

TTT GGA CTC CCT GGG TAC ATA AGG TTT TCA GTT AGA AGG AGA GAA GAG 912 
Phe Gly Leu Pro Gly Tyr He Arg Phe Ser Val Arg Arg Arg Glu Glu 
290 295 300 

AAT GAC AAA CTC ATA AAC ATC CTT AGA AAA ACA CTT AAT ACT 954 
Asn Asp Lys Leu He Asn He Leu Arg Lys Thr Leu Asn Thr 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 54: 










Met 


He 


Asn 


Gin 


He 


Asn 


Phe 


Lys 


Thr 


Ser 


His 


Gly 


Gly 


Ser 


Arg 


Glu 


1 








5 










10 










15 




Glu Gly 


Tyr 


He 


Asn 


Phe 


Ser 


Ala 


Ser 


Val 


Asn 


Pro 


Tyr 


Pro 


Pro 


Glu 






20 










25 










30 






Trp 


Thr 


Asp 


Glu 


Met 


Phe 


Glu 


Arg 


Ala 


Lys 


Lys 


He 


Ser 


Thr 


Phe 


Tyr 






35 










40 










45 








Pro 


Tyr 


Tyr 


Glu 


Lys 


Leu 


Glu 


Glu 


Glu 


Leu 


Ser 


Asp 


Leu 


He 


Gly 


Glu 




50 










55 










60 










Pro 


He 


Thr 


He 


Thr 


Ala 


Gly 


He 


Thr 


Glu 


Ala 


Leu 


Tyr 


Leu 


Leu 


Gly 


65 










70 








75 










80 


Val 


Trp 


Met 


Arg 


Gly 


Arg 


Lys 


Val 


He 


He 


Pro 


Lys 


His 


Thr 


Tyr 


Gly 










85 










90 










95 




Glu 


Tyr 


Glu 


Arg 


He 


Ser 


Arg 


Met 


Phe 


Gly 


Gly 


Arg 


Val 


He 


Lys 


Gly 








100 










105 










110 






Pro 


Asn 


Asp 


Pro 


Gly 


Lys 


Leu 


Ala 


Glu 


Phe 


Val 


Glu 


Arg 


Asn 


Ser 


Phe 






115 










120 










125 








Val 


Phe 


Phe 


Cys 


Asn 


Pro 


Asn 


Asn 


Pro 


Asp 


Gly 


Lys 


Phe 


Tyr 


Arg 


Glu 




130 










135 










140 










Lys 


Glu 


Met 


Lys 


Pro 


Leu 


Leu 


Asp 


Ala 


He 


Gin 


Asp 


Thr 


Asn 


Ser 


He 



145 








Leu 


He 


Leu 


Asp 


Pro 


Glu 


Gly 


Glu 








180 


Gly 


Leu 


Pro 


Gly 






195 




Phe 


Arg 


Ser 


Val 




210 






Phe 


Leu 


Glu 


Phe 


225 








Leu 


Pro 


Leu 


He 


Val 


Lys 


Ser 


Asp 








260 


He 


Glu 


Lys 


Leu 






275 




Phe 


Gly 


Leu 


Pro 




290 






Asn 


Asp 


Lys 


Leu 


305 











150 






Glu 


Ala 


Phe 


He 


165 








Asn 


He 


He 


Ara 


Val 




Val 


Glv 








200 


Arg 


Met 


Pro 


Trp 






215 




Leu 


Leu 


Lvs 


Asp 




230 






Trp 


Lvs 


Glu 


Lvs 


245 








Ala 


Asn 


Phe 


Phe 


Lys 


Glu 


Asn 


Gly 








280 


Gly 


Tyr 


He 


Arg 






295 




He 


Asn 


He 


Leu 




310 







155 



Asp 


Phe 


Val 


Lys 




170 






Leu 


Ara 


Thr 


Phe 


185 








Tvr 


Val 


He 


Gly 


Ser 


He 


Gly 


Ser 








220 


Asn 


Phe 


Lvs 


His 






235 




Glu 


Arg 


He 


Glu 




250 






He 


Met 


Lys 


Val 


265 








He 


Leu 


Val 


Arg 


Phe 


Ser 


Val 


Arg 








300 


Arg 


Lys 


Thr 


Leu 



315 



160 



Lvs 


Pro 


Glu 


Ser 






175 




Thr 


Lvs 


Ser 


Tvr 




190 






Phe 


Val 


Asp 


Ala 


20^5 








Thr 


Gly 


Val 


Ala 


Leu 


Arg 


Lys 


Thr 








240 


Lys 


Glu 


Leu 


Lys 






255 




Arg 


Glu 


Gly 


He 




270 






Asp 


Cys 


Lys 


Ser 


285 








Arg 


Arg 


Glu 


Glu 


Asn 


Thr 







